f 



WORLD INTELLECTUAL PROPERTY ORGAN; 
International Bureau 



INTERNATIONAL APHICATION PUBLISHED UNDER TOE PA 




COOPERATION TREATY (PCI) 



(51) International Patent Qassification S - 




(11) IntenatioDal Publication NumiMr: WO 91/16421 


C12N9/14, 15/29, 15/55 
C12N 5/04, AOIH 4/00 


Al 


(43) iDtatnatioDal PublitatioD Date: 31 October 1991 (31.10.91) 


C07K 13/00 





(21) International Application Number: PCT/US91/02960 

(22) International Filing Date: 25 April 199! (25.04.91) 



(30) Priority data: 
514,030 
620.426 
662,007 



26 Aprill99J) (26.04.90) US 
30 November 1990 (30.11.90) US 

27 February 1991 (27.0191) US 



(71) Applicant (for al! designated States except US): C ALGENE, 

INC. [US/US]; 1920 Fifth Street, Davis, CA 95616 (US). 

(72) Inventors; and 

(7^ Inventors/Applicants (for US only) : D AVIES, Huw. Maelor 
[GB/USI; 307 Grande Avenue, Davis, CA 95616 (US). 
POLLARD, Michael, Roman [GB/USJ; 5649 East Buck- 
eye Road, Madison, WI 53716 (US). VOELKER, Toni, 
Alois [DE/US]; 1206 Covell Place, Davis, CA 95616 
(US). THOMPSON, Gregory, A. [US/US]; 5127 Cowel 
Blvd., Davis, CA 95616 (US). 



(74) Agents: LASSEN, Elizabeth et al.; Calgene. Inc., 1920 
Fifth Street, Davis, CA 95616 (US). 



(81) Designated States: AT (European patent), BE (European 
patent), CA, CH (European patent), DE (European pa- 
tent), DK (European patent), ES (European patent), PR 
(European patent), GB (European patent), GR (Euro- 
pean patent), IT (European patent), JP, LU (European 
patent), NL (European patent), S£ (European patent), 
US. 



Pnblisfaed 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: PLANT THIOESTERASES 



(57) Abstract 

This invention relates to plant thioesterases, means to identify such proteins, amino acid and nucleic acid sequences asso- 
ciated with such protein, methods to obtain, make and/or use such plant thioesterases. Also, by this invention, the existence of a 
heretofore unproven factor critical to the biosynthesis of medium-chain fatty acids in plants is demonstrated. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


ES 


Spain 


MG 


Madaga«»r 


AU 


Ausiralia 


PI 


inland 


ML 


Malt 


BB 


Barbados 


PR 


France 


MN 


Mongolia 


BB 


Belgiuni 


GA 


Gabon 


MR 


Mauritania 


BP 


Burkina Faso 


GB 


United Kingdom 


MW 


Malawi 


BG 


Bulgaria 


GN 


Guinea 


NL 


Netherlands 


BJ 


Benin 


GR 


Greece 


NO 


Norway 


BR 


Brazil 


HU 


Hungary 


PL 


Poland 


CA 


Canada 


IT 


Italy 


RO 


Romania 


CF 


Central African Republic 


JP 


Japan 


SD 


Sudan 


CG 


Congp 


KP 


Democratic People's Republic 


SG 


Sweden 


CH 


SwilwrJand 




of Korea 


SN 


Senegal 


CI 


C611B d'lvotru 


KR 


Republic of Korea 


SU 


Soviet Union 


CM 


Cameroon 


LI 


Ucchlcnsltiin 


TD 


Chad 


CS 


C^hoslovakia 


LK 


Sri Lanka 


TG 


Togo 


DB 


Oerniany 


LU 


Luxembourg 


US 


United Staua of America 


DK 


Denmark 


MC 


Monaco 







PLANT THIQESTERASES 

5 

This application is a continuation-in-part of USSN 
07/662,007 filed February 27, 1991, and a continuation-in- 
part of USSN 07/514,030 filed April 26, 1990 and a 
continuation-in-part of USSN 07/520,426 filed November 30, 
10 1990. 

Tg^nhnical Field 

The present invention is directed to protein 
preparations, amino acid and nucleic acid sequences and 
15 constructs, and methods related thereto, 

INTRODUCTION 

B<ackgrovind 

"Improved" means to obtain or manipulate fatty acid 
20 compositions, from biosynthetic or natural plant sources, 
are needed. For example, novel oil products, improved 
sources of synthetic triacylglycerols (triglycerides), 
alternative sources of commercial oils, especially tropical 
oils (i.e., palm kernel and coconut oils) , and plant oils 
25 found in trace amounts from -natural sources are desired for 
a variety of industrial and food uses . 

To this end, the Fatty Acid Synthesis (FAS) system in 
plants and bacteria, FAS-II, has been studied. The 
mechanism of producing "long-chain fatty acids", i.e., 
30 fatty acids having a carbon chain length of equal to or 
greater than 16 carbon atoms (C16), via the acyl carrier 



protein (ACP) -dependent, plastid-localized FAS system of 
plants is relatively well characterized. However, the 
amino acid and corresponding niacleic acid sequences of many 
of the proteins responsible for this activity have not been 
determined. In particular, the enzyme by which free long- 
chain fatty acids are produced has been studied in several 
different crops. Nevertheless, the mechanism(s) by which 
plants produce fatty acids having shorter carbon chains, 
i.e., less than C16 atoms, including short-chain free fatty 
acidy (C4-C8) and medium-chain free fatty acids (C8-C14), 
has remained elusive. 

Characterization of thioest erases (also known as 
hydrolases) would be useful for the further study of plant 
FAS systems and for the development of novel and/or 
alternative oils sources. Generating a spread of C4, C6 
and C8 short chain 3-keto fatty acids could become a key 
improvement in polyhydroxybutyrate (PHB) -based 
biodegradable plastics made in bacteria and plants. 
Medium-chain fatty acids have special importance in the 
detergent and lubricant industries or in the formulation of 
edible oils with reduced caloric value or other health 
benefits. See for example, U.S. Pat. No. 4,863,753 and 
Barch, k.C. & Babayan, V.K., Azn. J. Clin. Nat. (1982) 
35:950-962. Longer chain fatty acids may have certain 
other utilities, i.e., C15 and C18 have particular uses in 
margarine and other solid oil-based products and very long 
chain fatty acids also have specialized uses, i.e., C22 is 
used to make peanut butter smoother. As such, a ready 
source of a variety of fatty acid lengths, including 



3tora,e lipi., „,,^ inco.po.atea .i«e.i„, chain 

length fatty aclda in ..^i.e. ratios, are .esired for a 
variety of industrial anC food use fields. As the 
Mosynthetic pathway for chain ter^nation of fatty acids 

' in plants is determlBcrt m. 

termmed, the system can be adapted for 

application in vivo and in vitro. 

Ihus, studies of Plant chain t.mdnation .echanisn. 
provide means to further enhance, control, modify or 
otherwise alter the length of fatty acid products or 

10 resulting triglycerides and oils l^H , 

elucidation of 

the factor (s) critical 

critical to the natural production of free 

fatty acids in plants is desired, including the 
purification of such factors and the characterisation of 
e a^t,s. and/or co-factors which enhance the efficiency 
Of the system. Of special interest are the nucleic acid 
se^ences of genes encoding factors related to the 
production of such free fatty acids for applications in 
genetic engineering. 

20 ReiPv^nt T.lr-rnriiM 

McKeon, T.A. & Stumpf, p.k j sim 

f , r.r.., cT. Biol. Chem. (1982) 

^57..1.l.l-ui„ reports a VOO-fold purification of 
safflower acyl ACP-thioesterase. other references 
reporting the purification and oharacteri.ation of long- 
Chain acyl-.cP thioesterases include Shire, et al., ^ch 
^lochem. Mcptys. ,1976) 17^:110-116; Ohlrooge, et al 
^C. Biocte. Biop,ys. ,„,8, 1«:382-3S1.. Imai, et ai' 
-iant Xipi. .iocwstry, .he Kinth Xnternaticnal Sy^:ium 
on Plant Upids, Wye College, Univ. of .ondon, auly 8-13 



(1990); Hellyer, A. & Slabas, A.R., Plant Lipid 
BiochemistrYf The Ninth International Symposium on Plant 
Lipids, Wye College, Univ. of London, July 8-13 (1990) • 

P.K.Stumpf, The Biochemistry df Plants (P.K. Stumpf & 
5 E. Conn, eds.) (1987) 5:121-136, summarizes mechanisms of 
termination of the fatty acid chain elongation pathway of a 
variety of chain-lengths in plants. Specific thioesterases 
for producing medium-chain fatty acids are postulated as 
well as other possible explanations. Harwood, J*L., Ann. 

10 J^ev. Plant Physiol. Mol. Bio. (1988) 35:101-138, references 
various possibilities in the literature regarding 
production of large amounts of medium-chain length fatty 
acids in some plant tissues and reports that all attempts 
to find a "suitable thioesterase" responsible for medium- 

15 chain fatty acid production have proved negative. Harwood, 
J.L., Grit. Rev. Plant Sci. (1989) 8:1-43, reviews current 
information ' regarding the production of medium-chain fatty 
acids in plants with the conclusion that very little is 
known. See also, Pollard, M.R, and Singh, S.S., The 

20 Wetaboiisin, Structure and Function of Plant Lipids, Stumpf, 
P.K,, Mudd, J.B., and Nes, W.D., eds. (Plenum Press, NY 
1987) pp. 455-463. 

BRIEF DESCRIPTION OF THE FIGURES 
25 Figure 1. Two peptide sequences and the degenerate 

oligonucleotides used in the PCR reaction to obtain the Bay 
thioesterase sequence are shown. "I" in the 
oligonucleotide sequences represents the nucleotide 
inosine. The lower case DNA sequence represent artificial 



5* ends designed to allow for subsequent cloning with the 
two chosen restriction enzymes (restriction sties 
underlined) . The oligonucleotide for peptide 701 (SEQ ID 
NO: 14) is SEQ ID NO: 32 and for peptide 698 (SEQ ID NO: 
12) is SEQ ID NO: 33. 

Figure 2. A fusion of both the PGR generated cDNA and 
the longest library clone of the Bay thioesterase is shown. 
The first 210 bases (SEQ ID NO: 34) are from the 800 bp PGR 
product. The gap represents unsequenced DNA, about 240 bp, 
as determined by restriction enzyme mapping. The remaining 
sequence (SEQ ID NO: 35) is from the PGR fragment and the 
library clone. Translation into the proper frame is shown 
under the sequence. Selected peptide sequences are 
depicted by horizontal lines under the respective protein 
sequence. Numbers shown correspond to those provided in 
Table 8. Mismatches with the sequence provided through 
protein sequencing are shown. 

Figure 3 . A sequence comparison is shown between two 
related Bay thioesterase cDNA clones isolated using the 800 
bp ?CR-generated fragment described in Example 14 ,0.2 . 
Sequence identity is shown by horizontal lines. 

Figure 4 . A full length sequence of a Bay 
thioesterase is shown. In Figure 4A, the amino acid 
sequence of the structural gene is given. In Figure 4B, 
the nucleic acid sequence is given. The amino acid 
sequence in Figure 4 A beings with the ATG codon at 181 to 
183 . As noted elsewhere in the specification, three 
possible ATG start codons are located in the first 219 base 
pairs of the nucleic acid sequence of Figure 4B. 



Figure 5 • A representation of the orientation of the 
fragments shown in Table 9 is provided. 

5 SUMMARY OF THE INVENTION 

This invention relates to plant thioesterases and 
encompasses both shorter-chain preferring and longer-chain 
preferring acyl-carrier substrate thioesterases - 
Especially of interest are conserved amino acid or nucleic 

10 acid sequences between such shorter-chain preferring and 
such longer-chain preferring acyl-carrier thioesterases . 
Methods are use such conserved sequences to obtain plant 
thioesterases is described. 

In a first embodiment, this invention is directed to 

15 nucleic acid sequences which encode a plant thioesterase. 
This includes sequences which encode biologically active 
plant thioesterases as well as sequences which are to be 
used as probes^ vectors for transformation or cloning 
intermediates. Biologically active sequences may be found 

20 in a sense or anti-sense orientation as to transcriptional 
regulatory regions found in various constructs. The plant 
thioesterase encoding sequence may encode a complete or 
partial sequence depending upon the intended use. All or a 
portion of the genomic sequence, cDNA sequence/^ precursor 

25 plant thioesterase or mature plant thioesterase is 
intended. 

Of special interest are recombinant DNA constructs 
which can provide for the transcription or transcription 
and translation (expression) of the plant thioesterase 



sequence. In particular, constructs which are capable of 
transcription or transcription and translation in plant 
host cells are preferred. Such construct may contain a 
variety of regulatory regions including transcriptional 
initiation regions obtained from genes' preferentially 
expressed in plant seed tissue. 

In a second aspect, this invention relates to the 
presence of such constructs in host cells, especially plant 
host cells. 

In a different aspect, this invention relates to 
transgenic host cells which have an expressed plant 
thioesterase therein. 

In yet a different aspect, this invention relates to a 
method for producing a plant thioesterase in a host cell or 
progeny thereof via the expression of a construct in the 
cell. Cells containing a plant thioesterase as a result of 
the production of the plant thioesterase encoding sequence 
are also contemplated herein. 

In a different embodiment, this invention relates to 
methods of using a DNA sequence encoding a plant 
thioesterase for the modification of the proportion of free 
fatty acids produced within a cell, especially plant cells. 
Plant cells having such a modified free fatty acid 
composition are also contemplated herein. 

In a further aspect of this invention, medium-chain 
preferring plant thioesterase proteins and sequences which 
are related thereto, including amino acid and nucleic acid 
sequences, are contemplated. Medium-chain preferring acyl- 
carrier thioesterases substantially free of other plant 



proteins are described. Medium-chain preferring acyl- 
carrier thioesterases which demonstrate preferential 
hydrolysis activity toward acyl-acyl carrier protein (ACP) 
substrates are of particular interest. Nucleic acid 
5 sequences and amino acid sequences of such proteins are 
described. 

In addition, methods to produce a medium-chain free 
fatty acid utilizing a medium-chain preferring acyl-carrier 
thioesterase is provided. 
10 Plant thioesterases exemplified herein include an 

Dmbellularla callfornica (Bay), Cuphea hookeriana (Cuphea) 
and Carthamus tinctiorius (saf flower) thioesterases. These 
exemplified thioesterases may be used to obtain other plant 
thioesterases of this invention. 

15 

DETAILED DESCRIPTION OF THE INVENTION 

A plant thioesterase of this invention includes any 
sequence of amino acids, such as a protein, polypeptide or 
peptide fragment obtainable from a plant source which 

20 demonstrates the ability to catalyze the production of free 
fatty acid{s) from fatty acyl-carrier substrates under 
plant enzyme reactive conditions. By "enzyme reactive 
conditions" is meant that any necessary conditions are 
available in an environment (i.e., such factors as 

25 temperature, pH, lack of inhibiting substances) which will 
permit the enzyme to function. 

Preferential activity of a plant thioesterase toward a 
particular chain-length fatty acyl-carrier substrate is 
determined upon comparison of free fatty acid product 



amounts obtained per different chain length substrates. 
For example, by "C12 preferring" is meant that the 
hydrolytic activity of the enzyme preparation demonstrates 
a preference for lauroyl, and perhaps decanoyl, over other 
substrates of different acyl carbon lengths. In a like 
fashion, a plant thioesterase having "CIO preferring" 
activity will show higher levels of activity toward 
decanoyl substrates, and perhaps octanoyl, over other 
substrates of different acyl carbon lengths. It is noted 
that some activity, of a significantly lesser magnitude, 
may be observed against other chain-length fatty acyl 
substrates, i.e,, the specificity will be substantial, but 
may not be absolute. 

As noted above, a plant thioesterase of this invention 
will display activity toward fatty acyl-carrier substrates. 
During biosynthesis of lipids in a plant cell, fatty acids 
are typically covalently bound to AGP ^ or coenzyme A (CoA) 
carriers. Plant thioesterases which display preferential 
activity toward acyl- AGP substrates are especially 
preferred because they are likely to be closely associated 
with the FAS pathway in immature embryo plastids . However, 
activity toward acyl-CoA substrates or other synthetic 
substrates, for example, is also contemplated herein. 

Other plant thioesterases are obtainable from the 
specific exemplified protein preparations and sequences 
provided herein. Furthermore, it will be apparent that one 
can obtain natural and synthetic plant thioesterases, 
including modified amino acid sequences and starting 
materials for synthetic-protein modeling from the 



-e^l«ie. pu.t thloesterases ana f.o. pla„t 
t^ioestera^es „Mch a« o.tai„ea throu,, the use of .„ch 
e«^U«ea ,.^.„ee3. «o^«e. a:„.„o a=i. ae^ences 
-clude ae^ances „Mch ta.e bean stated, truncated, 
increaaed and t.e lUe, whether auch aequences were 
partially or wholly syntheaized. Se^ence, which are 
-uaily purified .ro. plant preparations or are identical 
or encode identical proteins thereto, regardless of the 
-hod used to Obtain the protein or se^ence, are e^iiy 
10 considered naturally derived. 

^ T^us, one Skilled in the art will readily reco^i.e 
2^ antibody preparations, nucleic acid probes and 

'"-^ P^P-d and used to screen and 

recover "homologous" or "rel,t.rf" 
„ 5< or related" thioesterases from a 

" variety of plant sources t„„<„.,i . 

urces. Typically, nucleic acid probes 

are labeled tc allow detection, preferably with 
radioactivity although enzymes or other methods may also be 
-ed. Por i^nolcgical screening methods, antibody 
preparations either monoclonal or polyclonal are utilized. 
Polyclonal antibodies, although less specific, typically 
are more useful in gene isolation. Per detection, the 
antibody is labeled using radioactivity or any one of a 
variety of second antibody/enzyme conjugate systems that 
are commercially available. Bxanples of some of the 
« available antibody detection systems are described by 

Oberfilder ,.ocus ,1S89, BH. Ufe Technologies, Inc., ill 
5). ■'■ 

Homologous sequences are found when there is an 
identity Of science and may be determined upon con^arison 



of sequence information, nucleic acid or amino acid, or 
through hybridization reactions between a known 

thioesterase and a candidate source. Conservative changes, \ 

such as Glu/Asp, Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may | 

1 

5 also be considered in determining sequence homology. I 
Typically, a lengthy nucleic acid sequence may show as ^ 
little as 50-60% sequence identity, and more preferably at 
least about 70% sequence identity, between the target 
sequence and the. given plant thioesterase of interest 

10 excluding any deletions which may be present, and still be L 
considered related. Amino acid sequences are considered | 
homologous by as little as 25% sequence identity between i 
the two complete mature proteins . (See generally, 
Doolittle, R.F,, OF URFS and ORFS (University Science 

15 Books, CA, 1986.) 

A genomic or other appropriate library prepared from 
the candidate plant source of interest may be probed with 
conserved sequences from the plant thioesterase to identify ^ 

r 

homologously related sequences. Use of an entire cDNA or ^ 
20 other sequence may be employed if shorter probe sequences 1 
are not identified. Positive clones are then analyzed by 
restriction enzyme digestion and/or sequencing. When a 
genomic library is used, one or more sequences may be 
identified providing both the coding region, as well as the 
25 transcriptional regulatory elements of the thioesterase 
gene from such plant source. Probes can also be 
considerably shorter than the entire sequence. 
Oligonucletodies may be used, for example, but should be at 
least about 10, preferably at least about 15, more 



1 K^ifv^yifuz^^ 

preferably at least 20 nucleotides in length. Wlxen shorter 
length regions are used for comparison, a higher degree of 

sequence identity is required than for longer sequences. i 

f 

Shorter probes are often particularly useful for polymerase | 
5 chain reactions (PGR) , especially when highly conserved I 
sequences can be identified. {See, Gould, et al., PNAS USA 
(1989) 8ff:1934-1938.) 

When longer nucleic acid fragments are employed (>100 
bp) as probes, especially when using complete or large cDNA 

10 sequences, one would screen with low stringencies (for i 
example 40-50°C below the melting temperature of the probe) | 
in order to obtain signal from the target sample with 20- _ 
50% deviation, i.e., homologous sequences. (See, Beltz, et 
ai* Methods in Enzymology (1983) 200:266-285,). 

15 In a preferred embodiment , a plant thioesterase of 

this invention will have at least about 30% sequence 
identity, and more preferable at least about 50% sequence 
identity with at least a sequence of 8 amino acids of an 
exemplified plant thioesterase or a plant thioesterase 3 

20 which has in turn been obtained from a plant thioesterase L 
sequence. Alternatively, a plant thioesterase of this 
invention will have at least about 65% sequence identity 
and more preferably at least about 75% sequence homology 
with an exemplified plant thioesterase or a plant 

25 thioesterase which in turn has been obtained from a given 
plant thioesterase sequence. In particular, thioesterases 
which are obtainable from an amino acid or nucleic acid 
sequence of a Bay thioesterase (See, Fig. 4A or B) or a 
saf flower amino acid fragment of Table 9 (Example 14, 



infra) are especially preferred. The plant thioesterase 
may have preferential activity toward longer or shorter 
chain fatty acyl substrates. Plant thioesterases having 
long-chain preferring fatty acyl hydrolysis activity or 
5 medium- chain preferring fatty acyl hydrolysis activity are 
both considered homologously related proteins hereunder, 
for reasons as described in more detail further below. 

Again/ not only can sequences such as shown in Fig. 4 
and Table 9 be used to identify homologous plant 
10 thioesterases, but the resulting sequences obtained 

therefrom may also provide a further method to obtain plant 
thioesterases from other plant sources. In particular, PGR 
may be a useful technique to obtain related plant 
thioesterases from sequence data provided herein. One 
15 skilled in the art will be able to design oligonucleotide 
probes based upon sequence comparisons or regions of 
typically highly conserved sequence. Of special interest 
are probes based upon the S828 or the S829 fragment of 
Table 9. Details relating to the design and methods for a 
20 PGR reaction using these probes is described more fully in 
the examples. 

It should also be noted that plant thioesterases of a 
variety of sources can be used to investigate chain 
f termination events of plant fatty acid biosynthesis in a 

25 wide variety of plant and in vivo applications. Because 
all plants appear to synthesize fatty acids via a common 
metabolic pathway, the study and/or application of one 
plant thioesterase to a heterologous plant host may be 
readily achieved in a variety of species. In other 



applications, a plant thioesterase can be used in 
conjunction with plastid lysates outside the native plant 
source of the thioesterase to enhance the production and/or 
modify the composition of the fatty acids prepared in 
5 vitro. 

Once the nucleic acid sequence is obtained, the 
transcription, or transcription and translation 
(expression) , of the plant thioesterase in a host cell is 
desired to produce a ready source of the enzyme and/or 

10 modify the composition of fatty acids and/or triglycerides 
found therein. Other useful applications may be found when 
the host cell is a plant host cell, in vitro and in vivo. 

For example, by increasing the amount of a respective 
shorter-chain preferring thioesterase available to the 

15 plant FAS complex, an increased percentage of shorter chain 
fatty acids may be provided. In a like manner, for some 
applications it may be desired to decrease the amount of 
shorter-chain preferring acyl-ACP thioesterase endogenously 
expressed in a plant cell by anti-sense technology, for 

20 example, to increase the percentage of longer chain fatty 
acids, and visa versa. See, co-pending U,S. patent 
application No. 240,408 filed 8/30/88. The greater 
specificity of the plant thioesterase toward a given fatty- 
acyl substrate, the more control it may be possible to 

25 exert in the FAS system. 

Medium-Chain Preferring Plant Thioesterases 

By this invention, a mechanism for the biosynthesis of 
medium-chain fatty acids in plants is demonstrated. 
Namely, that specific plant thioesterases having 



15 



preferential activity toward medium-chain acyl substrates 
are involved in the accumulation of medium chain fatty 
acids in at least some plant species . 

The determination that chain-length specific plant 
thioesterases are active in the in vivo production of 
medium-chain fatty acids suggests several possibilities for 
enzyme plant sources. And in fact, medium-chain fatty 
acids are found in some natural plant species in abundance. 
For example, several species in the genus Cvphea accumulate 
triglycerides containing medium-chain fatty acids in their 
seeds, e.g., procujnbens, lutea^. hooJceriana, hyssopifoliar 
wrightii and in f lata. Another natural plant source of 
medium-chain fatty acids are seeds of the Lauraceae family: 
e.g., the California Bay (Umbellularia californica) , Pisa 
(Actinodophne hookeri) , Sweet Bay (Laurus nobilis) and 
Cinnamomum camphora (camphor) . Other plant sources include 
Ulmaceae (elm) , Myristicaceaer Simarubaceae, Vochysiaceae, 
and Salvadoraceae . 

Plants having significant presence of medium-chain 
fatty acids therein are preferred candidates to obtain 
naturally-derived medium-chain preferring plant 
thioesterases. However, it will also be recognized that 
other plant sources which do not have a significant 
presence of medium-chain fatty acids may be readily 
screened as other enzyme sources. In addition, a 
comparison between endogenous medium-chain preferring plant 
thioesterases and between longer and/or shorter chain 
preferring plant thioesterases may yield insights for 
protein modeling or other modifications to create synthetic 



) 

medium-chain preferring plant tiiioesterases as discussed 
above , 

Especially of interest are medium-chain preferring 
plant thioesterases which demonstrate preferential 
5 hydrolysis activity toward fatty acyl-ACP substrates. Most 
preferred are medium-chain preferring plant thioesterases 
which demonstrate a marked preference toward C12 acyl-ACP, 
CIO acyl-ACP or C8 acyl-ACP substrates. As described 
above, other plant sources may also provide sources for 

10 these enzymes through the use of protein purification, 
nucleic acid probes, antibody preparations, protein 
modeling, or sequence comparisons, for example, and of 
special interest are the respective amino acid and nucleic 
acid secjpiences corresponding to such plant thioesterases. 

15 Also as previously described, once nucleic acid sequence is 
obtained for the given plant thioesterase, further plant 
sequences may be compared and/or probed to obtain 
homologously related DNA sequences thereto and so on. 

Medium-chain preferring acyl-ACP .plant thioesterases 

20 have been partially purified from immature embryos of the 
California Bay (Ombellularia calif ornica) tree, hereinafter 
sometimes referred to as "Bay, " and Cuphea hookeriana, 
hereinafter sometimes referred to as ''Cuphea." The Bay 
thioesterase enzyme activity consistently co-migrates in 

25 chromatographic and electrophoretic separations with a 
protein or pair of proteins having an apparent Mr of 
approximately 34 kD, A native molecular weight of 
approximately 42 kD has been estimated by gel filtration 
chromatography suggesting that the enzyme is a. monomer of 



medium-chain preferring plant thioesterases as discussed 
above. 

Especially of interest are medium-chain preferring 
plant thioesterases which demonstrate preferential 
5 hydrolysis activity toward fatty acyl-ACP substrates. Most 
preferred are medium-chain preferring plant thioesterases 
which demonstrate a marked preference toward C12 acyl-ACP, 
CIO acyl-ACP or C8 acyl-ACP substrates. As described 
above, other plant sources may also provide sources for 

10 these enzymes through the use of protein purification, 
nucleic acid probes, antibody preparations, protein 
modeling, or sequence comparisons, for example, and of 
special interest are the respective amino acid and nucleic 
acid sequences corresponding to such plant thioesterases. 

15 Also as previously described, once nucleic acid sequence is 
obtained for the given plant thioesterase, further plant 
sequences may be compared and/or probed to obtain 
homologously related DNA sequences thereto and so on. 

Medium-chain preferring acyl-ACP plant thioesterases 

20 have been partially purified from immature embryos of the 
California Bay (Umbellularia callfornica) tree, hereinafter 
sometimes referred to as "Bay, " and Cuphea hookeriana, 
hereinafter sometimes referred to as "Cuphea." The Bay 
thioesterase enzyme activity consistently co-migrates in 

25 chromatographic and electrophoretic separations with a 
protein or pair of proteins having an apparent Mr of 
approximately 34 kD. A native molecular weight of 
approximately 42 kD has been estimated by gel filtration 
chromatography suggesting that the enzyme is a. monomer of 
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the 34 kD subunit. Affinity chromatography on immobilized 
ACP forms a critical step in the purification procedure, 
and resolves the.l2:0-ACP and 18:1-ACP thioesterases 
sufficiently to confirm that the medium-chain enzyme has 
5 negligible action on 18:1-ACP. The time-course of 

induction of 12:0-ACP thioesterase during seed development 
shows that the fatty acyl composition of the cotyledons 
changes abruptly from long-chain acyl groups to 
predominantly CIO and C12 at the earliest appearance of 
10 enzyme activity. 

As demonstrated more fully in the Examples, a plant 
thioesterase preparation having preferential hydrolase 
activity toward medium-chain fatty acyl-ACP substrates of 
California Bay substantially free of other plant proteins 
15 is Obtainable as follows: Briefly, a supernatant fraction 
of ground California Bay immature embryos is subjected to 
ammonium sulfate fractionation, followed by hydroxyapatite 
column chromatography of the redissolved pellet, applying 
carboxymethyl Sepharose chromatography to the eluted 
20 fractions, and further chromatography on a column of 
immobilized E. coli ACP . One or two proteins having an 
approximate molecular weight of 34 kD co-elutes or co-. 
migrate with the enzyme activity through a variety of 
chromatographic or electrophoretic techniques. These 
25 proteins correspond to the medium-chain thioesterase. (See 
also, Pollard et al., Archiv. Biochem Biophys, (1991) 
284:306-312, which is hereby incorporated by reference.) 

Also described in the Examples, are methods to obtain 
a partially purified Cuphea ClO-preferring acyl-ACP 
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thioesterase . The Cuphea thioesterase is partially 
purified from other plant proteins and activity is 
confirmed in the same general manner as the Bay 
thioesterase. As described more fully in the Examples, the 
5 various buffers and techniques may be different than those 
used in the Bay extraction. Enzyme activity is compared 
against various acyl-ACP substrates and demonstrates 
significantly more activity toward CIO acyl-ACP substrates 
as compared with other medium chain acyl-ACP substrates. 

10 Although the resulting Cuphea preparation also 

demonstrates activity against longer-chain substrate in 
addition to medium-chain fatty acyl-ACP sxibstrate^ both 
above described Bay and Cuphea preparations are considered 
to be "substantially free from other plant proteins" in 

15 that they demonstrate a recognizably distinct preferential 
activity toward medium-chain fatty acyl-ACP substrates. 
The resulting partially purified preparation (s) may be 
characterized by various parameters, including but not 
limited to comparative inhibitor studies and substrate 

20 specificity. 

As for both the Cuphea and Bay preparations, 
additional and/or alternative purification steps may be 
desired to purify the protein extract to homogeneity, to 
increase yield or the like. Moreover, now that the 

25 existence of these proteins is confirmed and various 

properties described, alternative purification protocols 
and/or additional purification steps are within the 
capabilities of one skilled in the art. 
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Also by this invention, sequence information regarding 
a long-chain thioesterase obtained from saff lower 
(Cartiiamus tinctorius) is described. Interestingly, it has 
5 been discovered that at least two of the peptide fragments 
sequenced from the safflower thioesterase show high 
sequence identity with portions of the Bay medium-chain 
preferring thioesterase . 

Although described in more detail in the Examples, the 
10 safflower thioesterase peptide fragments were obtained upon 
subjecting acetone ground safflower embryos to an acid 
precipitation followed by chromatography on an ACP column 
and a chromatofocusing column. Through analysis of enzyme 
activity peaks as compared with the proteins obtained from 
15 the ACP column, two proteins, one at approximately 34 kD 
and one at approximately 40 kD were selected for further 
analysis . Fragments sequenced after cyanogen bromide 
blotting are shown in Table 9 of Example 14. 

In particular, it was found that every fragment 
2.0 sequenced corresponding to the 34 kD band was detected in 
the 40 kD band. In addition, it appears that the 34 kD 
product shares the same N-^terminus as the 40 kD products A 
schematic representation postulating the positioning of 
various fragments from Table 9 is found in Fig, 5. In 
25 addition, it was found that segments of the Bay 

thioesterase amino acid secpaence demonstrated high sequence 
identity with at least two of the sequenced fragments, S828 
and S829, 
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Genetic Engineering Applications 

As is well known in the art, once a plant thioest erase 
is obtained, it may be used to obtain its corresponding 
amino acid and/or nucleic acid sequences thereto. As a 
5 representative example, the amino acid sequence may be 
obtained by the sequencing of peptide fragments resulting 
from partial protease digestion of protein blots recovered 
from a gel. For sequencing, the use of a two-dimensional 
gel may be desired over a one dimensional SDS-PAGE gel. 

10 The peptide fragments may be used to deduce amino acid 
sequences and eventually, amino acid sequence may be 
obtained. From the amino acid sequence, the information 
can be reverse translated and, nucleic acid probes can be. 
synthesized for use in PGR process or for use as probes in 

15 the recovery of the gene. As yet a different example, the 
purified protein may be used to raise antibodies thereto. 
The antibodies, polyclonal or monoclonal, may also be used 
to isolate other immunologically related plant thioesterase 
genes. Alternative methods will also be apparent in 

20 accordance with methods familiar to those skilled in the 
art. 

The nucleic acid sequences which encode plant 
thioesterases may be used in various constructs, for 
example, as probes to obtain further sequences. 
25 Alternatively, these sequences may be used in conjunction 
with appropriate regulatory sequences to increase levels of 
the respective thioesterase of interest in a host cell for 
recovery or study of the enzyme in vitro or in vivo or to 
decrease levels of the respective thioesterase of interest 
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for some applications when the host cell is a plant entity, 
including plant cells, plant parts (including but not 
limited to seeds, cuttings or tissues) and plants. 

A nucleic acid sequence encoding a plant thioesterase 
of this invention may include genomic, cDNA or mRNA 
sequence. By "encoding" is meant that the sequence 
corresponds to a particular amino acid sequence either in a 
sense or anti-sense orientation. By "extrachromosomal" is 
meant that the sequence is outside of the plant genome of - 
which it is naturally associated. By "recombinant" is 
meant that the sequence contains a genetically engineered 
modification through manipulation via mutagenesis, 
restriction enzymes, and the like. A cDNA sequence may or 
may not contain pre-processing sequences, such as transit 
peptide sequences. Transit peptide sequences facilitate 
the delivery of the protein to a given organelle and are 
cleaved from the amino acid moiety upon entry into the 
organelle, releasing the "mature" sequence. The use of the 
precursor DNA sequence is preferred in plant cell 
expression cassettes. Other plastid transit peptide 
sequences, such as a transit peptide of seed ACP, may be 
employed to translocate the plant thioesterase of this 
invention to various organelles of interest. See, U.S. 
Serial No. 07/437,764, filed 11/15/89 and European Patent 
Application Publication No. 189,707. In a like manner, 
once a given plant thioesterase transit peptide is 
obtained, it may be used to translocate sequences other 
than its native coding region. 
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Furthermorer as discussed above the complete genomic 
sequence of the plant thioesterase may be obtained by the 
screening of a genomic library with a probe, such as a cDNA 
probe, and isolating those sequences which regulate 
5 expression in seed tissue. In this manner, the 

transcription and translation initiation regions, introns, 
and/or transcript termination regions of the plant 
thioesterase may be obtained for use in a variety of DNA 
constructs, with or without the thioesterase structural 

10 gene. Thus, nucleic acid sequences corresponding to the 
plant thioesterase of this invention may also provide 
signal sequences useful to direct transport into a plastid, 
5* upstream non-coding regulatory regions (promoters) 
having useful tissue and timing profiles, 3' downstream 

15 non-coding regulatory region useful as transcriptional and 
translational regulatory regions and may lend insight into 
other features of the gene. 

Once the desired plant thioesterase nucleic acid 
sequence is obtained, it may be manipulated in a variety of 

20 ways. Where the sequence Involves non-coding flanking 
regions, the flanking regions may be subjected to 
resection, mutagenesis, etc. Thus, transitions, 
trans vers ions, deletions, and insertions may be performed 
on the naturally occurring sequence. In addition, all or 

25 part of the sequence may be synthesized. In the structural 
gene, one or more codons may be modified to provide for a 
modified amino acid sequence, or one or more codon 
mutations may be introduced to provide for a convenient 
restriction site or other purpose involved with 
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construction or expression. The structural gene may be 
further modified by employing synthetic adapters, linkers 
to introduce one or more convenient restriction sites, or 
the like. 

5 The nucleic acid or amino acid sequences encoding a 

plant thioesterase of this Invention may be combined with 
other non-native, or "heterologous", sequences in a variety 
of ways. By "heterologous" sequences is meant any sequence 
which is not naturally found joined to the plant 
10 thioesterase, including, for example, combinations of 

nucleic acid sequences from the same plant which are not 
naturally found joined together. 

The DNA sequence encoding a plant thioesterase of this 
invention may be employed in conjunction with all or part 
15 of the gene sequences normally associated with the 

thioesterase. In its component parts, a DNA sequence 
encoding thioesterase is combined in a DNA construct 
having, in the 5' to 3' direction of transcription, a 
transcription initiation control region capable of 
20 promoting transcription and translation in a host cell, the 
DNA sequence encoding plant thioesterase and a 
transcription and translation termination region. 

Potential host cells include both prokaryotic and 
eukaryotic cells. A host cell may be unicellular or found 
25 in a multicellar differentiated or undifferentiated 

organism depending upon the intended use. Cells of this 
invention may be distinguished by having a plant 
thioesterase foreign to the wild-type cell present therein, 



24 



l'Ll/US91/02960 



for example, by having a recombinant nucleic acid construct 
encoding a plant thioesterase therein. 

Depending upon the host, the regulatory regions will 
vary, including regions from viral, plasmid or chromosomal 
5 genes, or the like. For expression in prokaryotic or 

eukaryotic microorganisms, particularly unicellular hosts, 
a wide variety of constitutive or regulatable promoters may 
be employed. Expression in a microorganism can provide a 
ready source of the plant enzyme, ' Among transcriptional 
10 initiation regions which have been described are regions 
from bacterial and yeast hosts, such as E. coli, B. 
suhtilis, Sacchromyces cerevisiaer including genes such as, 
beta-galactosidase, T7 polymerase, tryptophan E and the 
like . 

15 For the most part, the constructs will involve 

regulatory regions functional in plants which provide for 
modified production of plant thioesterase, and possibly, 
modification of the fatty acid composition. The open 
reading frame, coding for the plant thioesterase or 

20 functional fragment thereof will be joined at its 5' end to 
a transcription initiation regulatory region such as the 
wild-type sequence naturally found 5' upstream to the 
thioesterase structural gene. Numerous other transcription 
initiation regions are available which provide for a wide 

25 variety of constitutive or regulatable, e.g., inducible, 
transcription of the structural gene functions. Among 
transcriptional initiation regions used for plants are such 
regions associated with the structural genes such as for 
nopaline and mannopine synthases, or with napin, ACP 
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promoters and the like. The transcription/translation 
initiation regions corresponding to such structural genes 
are found immediately 5' upstream to the respective start 
codons. In embodiments wherein the expression of the 
thioesterase protein is desired in a plant host, the use of 
all or part of the complete plant thioesterase gene is 
desired; namely all or part of the 5' upstream non-coding 
regions (promoter) together with the structural gene ^ 
sequence and 3' downstream non-coding regions may be 
employed- If a different promoter is desired, such as a 
promoter native to the plant host of interest or a modified 
promoter, i.e., having transcription initiation regions 
derived from one gene source and translation initiation 
regions derived from a different gene source, including the 
sequence encoding the plant thioesterase of interest, or 
enhanced promoters, such as double 35S CaMV promoters, the 
sequences may be joined together using standard techniques. 

For such applications when 5' upstream non-coding 
regions are obtained from other genes regulated during seed 
maturation, those preferentially expressed in plant embryo 
tissue, such as AGP and napin-derived transcription 
initiation control regions, are desired. Such "seed- 
specific promoters" may be obtained and used in accordance 
with the teachings of U.S. Serial No. 07/147,781, filed 
1/25/88 (now U.S. Serial No. 07/550,804, filed 7/9/90), and 
U.S. Serial No. 07/494,722 filed on or about March 16, 1990 
having a title "Novel Sequences Preferentially Expressed In 
Early Seed Development and Methods Related Thereto," which 
references are hereby incorporated by reference. 
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Transcription initiation regions which are preferentially 
expressed in seed tissue, i.e., which are undetectable in 
other plant parts, are considered desirable for fatty acid 
modifications in order to minimize any disruptive or 
5 adverse effects of the gene product. 

Regulatory transcript termination regions may be 
provided in DNA constructs of this invention as well. 
Transcript termination regions may be provided by the DNA 
sequence encoding the plant thioesterase or a convenient 

10 transcription termination region derived from a different 
gene source, for example, the transcript termination region 
which is naturally associated with the transcript 
initiation region. Where the transcript termination region 
is from a different gene source, it will contain at least 

15 about 0.5kb, preferably about l-3kb of sequence 3' to the 
structural gene from which the termination region is 
derived. 

Plant expression or transcription constructs having a 
plant thioesterase as the DNA sequence of interest for 

20 increased or decreased expression thereof may be employed 
with a wide variety of plant life, particularly, plant life 
involved in the production of vegetable oils for edible and 
industrial uses. Most especially preferred are temperate 
oilseed crops. Plants of interest include, but are not 

25 limited to, rapeseed (Canola and High Erucic Acid 

varieties), sunflower, safflower, cotton, Cuphea, soybean, 
peanut, coconut and oil palms, and corn. Depending on the 
method for introducing the recombinant constructs into the 
host cell, other DNA sequences may be required. 
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Importantly, this invention is applicable to dicotyledyons 
and monocotyledons species alike and will be readily 
applicable to new and/or improved transformation and 
regulation techniques . 
5 The method of transformation is not critical to the 

instant invention; various methods of plant transformation 
are currently available. As newer methods are available to 
transform crops, they may be directly applied hereunder. 
For example, many, plant species naturally susceptible to 
10 Agrojbacterium infection may be successfully transformed via 
tripartite or binary vector methods of Agrobacterium 
mediated, transformation. In addition, techniques of 
microinjection, DNA particle bombardment^ electroporation 
have been developed which allow for the transformation of 
15 various monocot and dicot plant species- 

In developing the DNA construct, the various 
components of the construct or fragments thereof will 
normally be inserted into a convenient cloning vector which 
is capable of replication in a bacterial host,, e.g., £. 
20 coli. Numerous vectors exist that have been described in 
the literature. After each cloning, the plasmid may be 
isolated and subjected to further manipulation, such as 
restriction, insertion of new fragments, ligation, 
deletion, insertion, resection, etc., so as to tailor the 
25 components of the desired sequence. Once the construct has 
been completed, it may then be transferred to an 
appropriate vector for further manipulation in accordance 
with the manner of transformation of the host cell. 
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Normally, included with the DNA construct will be a 
structural gene having the necessary regulatory regions for 
expression in a host and providing for selection of 
transformant cells. The gene may provide for resistance to 
5 a cytotoxic agent, e.g. antibiotic, heavy metal, toxin, 
etc . , complementation providing prototrophy to an 
auxotrophic host, viral immunity or the like. Depending 
upon the number of different host species the expression 
construct or components thereof are introduced, one or more 

10 markers may be employed, where different conditions for 
selection are used for the different hosts. 

It is noted that the degeneracy of the DNA code 
provides that some codon substitutions are permissible of 
DNA sequences without any corresponding modification of the 

15 amino acid sequence. When any non-plant derived DNA 

sequence is to be expressed in a plant host cell, the use 
of "plant preferred codons" is desirable. 

As mentioned above, the manner in which the DNA 
construct is introduced into the plant host is not critical 

20 to this invention. Any method which provides for efficient 
transformation may be employed. Various methods for plant 
cell transformation include the use of Ti- or Ri-plasmids, 
microinjection, electroporation, DNA particle bombardment, 
liposome fusion, DNA bombardment or the like. In many 

25 instances, it will be desirable to have the construct 

bordered on one or both sides by T-DNA, particularly having 
the left and right borders, more particularly the right 
border. This is particularly useful when the construct 
uses A. tumefaciens ox A. rhlzogenes as a mode for 
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transformation, although the T-DNA borders may find use 
with other modes of transformation. 

Where Agrobacterium is used for plant cell 
transformation, a vector may be used which may be 
5 introduced into the Agrobacterium host for homologous 

recombination with T-DNA or the Ti- or Ri-plasmid present 
in the Agrobacterium host. The Ti- or Ri-plasmid 
containing the T-DNA for recombination may be armed 
(capable of causing gall formation) or disarmed (incapable 

10 of causing gall formation) , the latter being permissible, 
so long as the vir genes are present in the transformed 
AgrroJbacteriuiT? host* The armed plasn^id can give a mixture 
of normal plant cells and gall. 

In some instances where Agrobacterium is used as the 

15 vehicle for transforming plant cells, the expression 

construct bordered by the T-DNA border (s) will be inserted 
into a broad host spectrum vector, there being broad host 
spectrum vectors described in the literature. Commonly 
used is pRK2 or derivatives thereof. See, for example, 

20 Ditta et ai., PNAS USA, (1980) 77:7347-7351 and EPA 0 120 
515,, which are incorporated herein by reference. Included 
with the expression construct and the T-DNA will be one or 
more markers, which allow for selection of transformed 
AgroJbacterluin and transformed plant cells. A number of 

25 markers have been developed for use with plant cells, such 
as resistance to chloramphenicol, the aminoglycoside 6418, 
hygromycin, or the like. The particular marker employed is 
not essential to this invention, one or another marker 
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being preferred depending on the particular host and the 
manner of construction. 

For transformation of plant cells using Agrroi)acteriu/n, 
explants may be combined and incubated with the transformed 
AgrroJbacteriujn for sufficient time for transformation,, the 
bacteria killed, and the plant cells cultured in an 
appropriate selective medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate 
plant hormones in accordance with known methods and the 
shoots transferred to rooting medium for regeneration of 
plants. The plants may then be grown to seed and the seed 
used to establish repetitive generations and for isolation 
of vegetable oils. 

The invention now being generally described, it will 
be more readily understood by reference to the following 
examples which are included for purposes of illustration 
only and are not intended to limit the present invention. 

EXAMPLES 

In the experimental disclosure which follows, all 
temperatures are given in degrees centigrade C) , weights 
are given in grams (g) , milligram (mg) or micrograms (lig) , 
concentrations are given as molar (M) , millimolar (mM) or 
micromolar (^IM) and all volumes are given in liters (1) , 
microliters (p.1) or milliliters (ml), unless otherwise 
indicated. 

Example 1 - C12-Pref erring Aayl-ACP Thioesterase Assay 

To assay for C12 thioesterase activity the following 
mixture is incubated at 30® for 30 min: "buffer" 
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comprising 7inM KH2PO4-KOH pH 8, 20% v/v glycerol, ImM 
dithiothreitol (DTT) , 0.1% v/v Triton XlOO; sample to be 
tested for activity in the same or similar buffer as the 
"extraction buffer" described in Example 2; and 5\il of ^^C- 
5 radiolabeled lauroyl-ACP substrate for a total volume of 
lOO^ll and final lauroyl-ACP concentration of O.S^M. 
Lauroyl-ACP substrate is prepared according to the method 
of Rock et al (Methods in Enzymology (1981) 72:397-403), 
using ACP prepared from Escherichia coli by the method of 
10 Rock and Cronan {Methods in Enzymology (1981) 71:341-351). 
The laurate is radiolabeled in the carboxylate group at a 
specific radioactivity of 50-60|iCi/^mol . 

The reaction is stopped by adding 0.5ml cold (0°) 10% 
v/v acetic acid. The fatty acid product of the hydrolytic 
15 enzyme action is extracted away from the unhydrolyzed 
substrate by adding 1ml diethyl ether and mixing 
vigorously. After settling for a few minutes the upper 
ether layer is transferred to 5ml scintillation fluid for 
determination of radioactivity by liquid scintillation 
20 spectrometry. Additional ether extractions may be 
performed to recover remaining traces of the reaction 
product for more accurate quantitation of the enzyme 
activity if desired. The amount of ether-extracted 
radioactivity is a direct measure of C12-preferring acyl- 
25 ACP thioesterase activity, provided the amount of enzyme is 
not sufficient to hydrolyze more than about 25% of the 
substrate. With greater activity than this the 
relationship between radioactivity in the ether layer and 
the quantity of enzyme becomes markedly nonlinear. The 
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enzyme preparation must then be diluted appropriately to 
bring the activity into the linear range of the assay. 

The activity is confirmed to be thioesterase by 
analysis of the ether-soluble product using thin-layer 
chromatography (TLC) . The product co-migrates with 
authentic laurate on a silica TLC plate (solvent: 80% 
hexane, 20% diethyl ether, 1% acetic acid v/v) . If 
phenacyl esters are prepared (Borch, R.F., Analytical 
Chemistry (1975) 47:2437-2439) using the ether product- 
containing layer from the assay procedure, the resulting 
radioactive spot co-migrates with authentic lauroyl 
phenacyl ester on a C18 TLC plate (solvent: 100% methanol), 
as does the product of base hydrolysis of the lauroyl-ACP 
substrate. These observations verify that the ether- 
extracted product of the enzyme reaction is free laurate. 
It is also deduced that the enzyme of interest hydrolyzes 
the thioester bond, e.g. it cannot be a protease attacking 
the ACP moiety of the substrate or the product would be 
lauroyl-phosphopantetheine which would have migrated 
differently on TLC. 

Example 2 - Bay Thioesterase Purification & Identification 

Immature seeds of Vnihellularia californica ("Bay") are 
harvested at the stage at which decanoate and laurate 
predominate in the fatty acid composition as determined by 
total fatty acid analysis of the cotyledons. The 
cotyledons from such seeds are dissected from the other 
seed parts and stored frozen at -70°. This comprises the 
source tissue for enzyme extraction • 
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The frozen cotyledons are powdered in a stainless 
steel mortar and pestle at approximately -70°, and the 
powder is stored under liquid nitrogen or at -70° until 
required. Extraction is accomplished by adding, at 0-4®, 
to the powder 4ml/g of "extraction buffer" comprising 50mM 
KH2PO4-KOH pH 6.9, 5mM ethylenediamine tetraacetate (EDTA) , 
2mM DTT, ImM sodium ascorbate, ImM phenylmethylsulfonyl 
fluoride, l\m leupeptin, and l]m pepstatin. The stirred 
mixture of powder and buffer is blended in a motorized 
macerator (Brinkmann (Westbury, NY) ^'Polytron'', three . 
bursts of 45 sec each) and then filtered through four 
layers of cheesecloth. This and all subsequent steps are 
conducted at 0-4**. The resulting filtrate is centrifuged 
at approximately 14,000xg (max.) for 30 min. The 
supernatant fractions are filtered through '"Miracloth" 
(Calbiochem. Corp., LaJolla, CA) and the resulting liquid 
is referred to as the ''crude extract". 

The crude extract is subjected to ammonium sulfate 
fractionation as follows. Sufficient solid ammonium sulfate 
is gradually added with stirring over 30 min to achieve 70% 
saturation. The preparation is then stirred for a further 
30 min. After centrifuging as described above, the 
pelleted material is resuspended in extraction buffer 
(2ml/g original tissue weight) and stirred for 10 min until 
dissolved. Ammonium sulfate is then added as before, but 
this time only sufficient to achieve 50% saturation. After 
centrifuging as before, the supernatant fraction is 
discarded. The pelleted material, which contains the C12- 
preferring acyl-ACP thioesterase, may be frozen by 
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imraersion in liquid nitrogen and then stored at -70® at 
this stage if desired. The resulting material is referred 
to as the "ainiaonium sulfate fraction." Very little of the 
C12-preferring acyl-ACP thioesterase activity is lost if 
5 the pellet is frozen very rapidly. 

After thawing to 4** if necessary, the pellet material 
is resuspended in "HAl buffer" (Iml/g original tissue 
weight), comprising 50raM KH2PO4-KOH pH 6.9, 25% w/v 
glycerol, ImM DTT. The resuspended preparation is placed 

10 in dialysis tubing (12,000-14,000 molecular weight cutoff) 
and set to dialyze against HAl buffer. (Typically a 
preparation from 600g of cotyledon tissue will require two 
successive dialysis steps against 4 liters of buffer each, 
for at least three hours each.) Before application to the 

15 first column, the dialyzed material is centrifuged as 
. described above and the pelleted material is discarded. 
The supernatant material from post-dialysis 
centrifugation is applied to a column of hydroxyapatite 
(HA-Dltrogel from IBF Biotechnics, catalog, no. 247741, 

20 Savage, MD; for a preparation from 500-120 Og of tissue 

typically 10cm diameter x 12.5cm bed height), equilibrated 
in HAl buffer. The column is then washed with HAl buffer 
until the absorbance of the effluent at 280nm no longer 
changes. A- considerable amount of protein and sometimes a 

25 small amount of the C12-preferring acyl-ACP thioesterase 
activity fail to bind the column and are washed through it. 
The bulk of the C12-preferring acyl-ACP thioesterase 
activity binds, and is eluted by applying "HA2 buffer" 
comprising 400mM KH2PO4-KOH pH 6.9, 25% w/v glycerol, ImM 
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DTT. The effluent is collected in fractions (5-10 ml in 
volume), which are then assayed for C12-preferring acyl-ACP 
thioesterase activity. The active fractions are combined 
and dialyzed as- described above, against "CMl puffer" 
5 comprising 5mM KH2PO4-KOH pH 6,5, 25% w/v glycerol, ImM 

EDTA, ImM DTT (typically three dialysis periods of at least 
3 hr each against 4 liters each) . After dialysis the 
material is clarified by centrifugation as described 
previously, the pellets being discarded. 

10 The supernatant fraction is then applied to a cation 

exchange column (Pharmacia CM-Sepharose Fast Flow, 
Piscataway, NJ, catalog no- 17-0719-01, 10cm diameter x 
14cm bed height) equilibrated with CMl buffer. After 
loading, the column is washed with CMl buffer until the 

15 absorbance of the effluent stream at 280nm no longer 
changes. A considerable quantity of protein and a 
significant amount {e.g. 50%) of the Cl2-pref erring acyl- 
ACP thioesterase activity fail to bind the column and are 
washed through it. This partial binding of the C12- 

20 preferring acyl-ACP thioesterase has been investigated and 
found to result from aggregation of this enzyme with other, 
unidentified proteins at the time of extraction. In effect 
there are two populations of the C12-preferring acyl-ACP 
thioesterase up to this point in the purification scheme, 

25 free enzyme and aggregate. The cation exchange column 
separates these two forms and the aggregate is discarded. 
The unaggregated form of the C12 acyl-ACP thioesterase is 
eluted from the column by applying "CM2 buffer" comprising 
50mM KH2PO4-KOH pH 6.9, 150mM NaCl, 25% w/v glycerol, ImM 
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EDTA, IrnM DTT. The effluent stream is fractionated and 
assayed as before, and the active fractions are pooled and 
dialyzed against "ACPI buffer" comprising lOmM KH2PO4-KOH 
pH 6.5, 150mM NaCl, 25% w/v glycerol, ImM EDTA, ImM DTT, 
5 0.1% w/v 3- [ (3-cholamidopropyl) -dimethylammonio] -1- 
propanesulfonate (CHAPS) . Typically two successive 
dialyses for at least 3 hr each, against 4 liters each, 
suffice for a preparation from 600g tissue. 

The dialyzed material is then applied to a column of 

10 immobilized ACP (2.5cm diameter x 10.5cm bed height) . This 
column is manufactured by coupling Escherichia coli AC? to 
cyanogen bromide-activated Sepharose 4B according to 
instructions supplied by the manufacturer of this column . 
packing (Pharmacia Inc., Piscataway, NJ) . The E. coli ACP 

15 is prepared as referenced in Example 1. Before use the 
column is equilibrated with ACPI buffer • The dialyzed 
material from the cation exchange column is applied at 1- 
1.3ml/min, and fractions of 8ml volume are collected 
throughout the procedure. Fractions are assayed for C12- 

20 preferring acyl-ACP thioesterase activity, and for total 
protein . content using a Coomassie Blue assay method (Bio- 
Rad Inc., Richmond, CA, catalog no. 500-0001). A 
substantial amount of protein washes through the column 
without binding. Almost all of the C12-pref erring acyl-ACP 

25 thioesterase activity binds. The column is washed with 

ACPI buffer until the protein assay detects no more protein 
in the effluent stream. It is then washed with "ACP2 
buffer" comprising 50mM KH2PO4-KOH pH 8.5, 50mM glycine, 
25% w/v glycerol, ImM EDTA, ImM DTT, 0.1% w/v CHAPS. This 
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high pH wash serves to remove nonspecif ically bound 
protein. A small amount of C12 acyl-ACP thioesterase 
activity is occasionally co-eluted with it. After the 
protein assay has again indicated that no more protein is 
5 being eluted, a linear "elation gradient" is applied. This 
comprises 560ml combined volume of "ACP3 buffer" (lOOmM 
KH2P04~KOH pH 6.9, 25% w/v glycerol, ImM EDTA, ImM DTT, 
0.1% w/v CHAPS) and "ACP4 buffer" (SOOmM KH2PO4-KOH pH 6.9, 
25% w/v glycerol, ImM EDTA, ImM DTT, 0.1% w/v CHAPS) . If 
10 C12-preferring acyl--ACP thioesterase activity is still 

eluting from the column when the gradient ends, its elution 
can be completed by applying more ACP4 buffer. The 
collected fractions are assayed as before, and a second 
C12-preferring acyl-ACP thioesterase assay is also 
15 performed with the fractions diluted fifty-fold. By 

compensating for nonlinearity of the assay this gives a 
more precise location of the maximum enzyme activity. The 
C12-preferring acyl-ACP thioesterase activity is typically 
present in the gradient-eluted fractions as two peaks, a 
20 smaller one eluting just before a much larger one. 
The fractions comprising each peak are pooled 
separately. The larger, later eluting peak is the most 
pure material that is used for subsequent experiments, 
protein sequencing etc. Analysis of this material by 
25 typical SDS-PAGE procedures shows only 5-6 strongly 

staining bands including a band of an approximate molecular 
weight at 34 kD and a few weakly staining ones. 

Aliquots of fractions from the ACP column are analyzed 
by SDS polyacrylamide gel electrophoresis (SDS-PAGE) and 



silver staining. The band pattern at the peak of eluted 
activity is markedly simplified relative to the flow-' 
through and pH 8,5-eluted material. Band patterns are 
compared from fraction to fraction 'to identify bands whose 
5 intensities increase and decrease in concert with 12:0-ACP 
thioesterase activity. One band pattern corresponding to 
an approximate molecular weight of 34 kD, satisfied this 
criterion. In some preparations a closely spaced doublet 
is seen at this position on the SDS gel. 

10 Alternatively/^ a variety, of chromatographic and 

electrophoretic techniques may be applied to the 
substantially purified 12:0-ACP thioesterase pool from the 
ACP column, including ion-exchange chromatography, 
immobilized dye chromatography, and native gel 

15 electrophoresis • None of them purifies the enzyme to 

electrophoretic homogeneity. However, in all cases a band 
or pair of bands of approximate molecular weight 34 kD co- 
elutes or co-migrates with the enzyme activity. The best 
resolution is obtained by chromatography on S-Sepharose 

20 followed by Blue 4 agarose, with the most informative 

separation occurring on the final Blue 4 agarose column. 
The most abundant eluted proteins are those of approximate 
molecular weight 65 kD, 39 kD, and 34 kD (doublet) . Only 
the 34 kD pair elutes in synchrony with the peak of 12:0- 

25 ACP thioesterase activity. 

Example - C12-'preferring acyl-ACP Thioesterase Inhibitor 
Studies 

Table 1 below reports inhibition of Bay cotyledon C12- 
30 preferring acyl-ACP thioesterase by thiol reagents observed 



39 



rcr/i;59iAO29O0 



when an ammonium sulfate fraction (see, Example 2) was 
assayed (see, Example 1) . 



Addition 
to Assay 



Mean 
Activity* 



Percent 
Inhibition 



10 



15 



None (control 
0.5mM iodoacetamide 
5mM 

0*5mM N-ethylmaleimide 
5mM 



4322 
4180 
4047 
4320 
103 



3 
6 
0 
98 



20 



*"Mean Activity" is a measurement of the mean score of 
duplicates provided in cpm as observed in the ether layer 
of Example 1 

After removal of dithiothreitol from an ammonium 
sulfate fraction preparation by passage through a small 
column of G25-50 gel filtration medium (Pharmacia, 
Piscataway, NJ) the following assay results were observed. 



25 



30 



Addition 
to Assay 



None (control) 
5mM iodoacetamide 
5mM N-ethylmaleimide 



Tablg 2 

Mean 
Activity* 



3776 
3851 
269 



Percent 
Inhibition 



0 
93 



*"Mean Activity" is a measurement of the mean score of 
duplicates provided in cpm as observed in the ether layer 
of Example 1 

These preliminary inhibitor studies indicate that the 
Bay C12-preferring acyl-ACP thioesterase is insensitive to 



SmM iodoacetamide aixd almost completely inhibited by 5mM N- 
ethylmaleimide . These results suggest that C12"-pref erring 
acyl-ACP thioesterase is an "active thiol" type of esterase 
rather than an "active serine" typei 

5 

Example 4 - Bay C12-preferring Acyl-ACP Thioesterase 

Substrate Specificity as a Function of Chain 
Length 

In tests comparing activity of the amonium sulfate 
10 fraction preparations of Bay C12-preferring acyl-ACP 

thioesterase of Example 2 against various length medium- 
chain fatty acids in the assay of Example 1, the greatest 
activity has been manifest towards C12-ACP over C8, CIO, 
C12, C14 and CI 6 ACP substrates as shown in Table 3, 



15 



Table 3 



20 



25 



Acyl-ACP acyl 



8 
10 
11 
12 
14 
16 

*C8-ACP activity set to 1.0 



Relative 
Thioesterase Achivity* 



1.0 
2.7 
3.7 
24.0 
4.0 
4.7 



Example 5 - Bay C12-pref erring Thioesterase Substrate 
30 Specificity as a Function of ACP versus CoA 

Crude extracts of Bay cotyledons hydrolyze lauroyl 

• coenzyme A (CoA) as well as lauroyl-ACP. This is due to 
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the presence of separate enzymes acting respectively on 
these substrates, i,e. to C12"preferring acyl-ACP 
thioesterase acting on lauroyl-ACP and another enzyme 
acting on lauroyl CoA. The distinct nature of these 
5 enzymes is indicated by their separation at the ACP column 
stage in the purification scheme. Lauroyl-CoA hydrolysis 
activity is found chiefly in the material which fails to 
bind the. ACP column, and C12-preferring acyl-ACP 
thioesterase activity is found in the material which binds 
10 and which is subsequently eluted with a phosphate 

concentration gradient. Activities of the peak fraction of 
unbound and bound material serves to illustrate this 
separation, as shown in the following table. 

15 

Activity on Activity on 

Fraction C12-CoA Sn b.gt.rate* C12"ACg SuhStcatfi* 

Flow-through 10808 300 

(nonbinding) 

20 Gradient-eluted 27 . 2772 

*cpm of ether-extract able" product 

Therefore, the Bay C12-pref erring acyl-ACP 
thioesterase shows much more activity towards lauroyl-ACP 
than towards lauroyl-CoA. 

25 

Example 6 - Role of the Enzyme in Lavrate Production 

Further evidence that the C12-preferring acyl-ACP 
thioesterase is involved in the biosynthesis of laurate 
groups that predominate in the Bay seeds comes from a 
30 comparison of the extract able activity of the enzyme at two 
different stages of seed development. As shown in the 



following table, Table 5, very young seeds, which contain 
only long-chain fatty acids and insignificant amounts of 
laurate, yield much less C12-pref erring ACP thioesterase 
than older seeds that have accumulated significant amounts 
5 of laurate. Thus it appears that significant activity of 
this enzyme is only present when the seeds are accumulating 
laurate. Additionally, there appears to be much less 
difference in lauroyl-CoA hydrolysis activity, consistent • 
with their being different enzymes as discussed above in 
10 Example 5. 

Table 5 

C12 acyl-CoA C12 acyl-ACP 

Hydrolysis Thioesterase 
Tissue Source Activity Tn AR.qay.c!* Activity In Afisayg* 

15 Normal Seeds 31,268 4704 

2g/cotyledon 
pair) 

Young Seeds 29,995 376 

0.5g/cotyiedon 
20 pair) 

*cpm ether-extractable radioactivity 

25 Example 7 - in vitro Bay Fatty Acid Synthesis Assay 

An ammonium sulfate fraction of a Bay embryo extract 
will synthesize the same specific fatty acids as those 
found in the maturing seed if supplied with E. coli ACP, 
malonyl-CoA, and other typical cofactor and substrate 

30 requirements of documented in vitro fatty acid synthesizing 
systems (Jaworski, et al., Arch. Biochein. Biophys. (1974) 
I6"3:769-776) . The products of this in vitro activity 
include water-soluble octanoyl and decanoyl esters but 
almost undetectable water-soluble lauroyl ester, even 
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though laurate is the major free fatty acid product. These 
results are most simply explained in terms of the fatty 
acid synthesizing system producing acyl-ACP's of 
successively increased chain length, and the specific 
lauroyl-ACP thioesterase intercepting the acyl-ACP when the 
acyl moiety has been extended as far as twelve carbon 
atoms, by hydrolyzing apart the acyl and ACP moieties at 
that stage. 

Example 8 - C-IO Preferring Acyl- ACP Thioesterase Assay 

Following the same general procedures as outlined in 
Example 1, to assay for CIO thioesterase activity, the 
following mixture is incubated at 30^ for 10-60 rain: 50^11 
sample to be tested in the same or similar "extraction 
buffer" described in Example 9A, and approximately 250pmol 
of [^'^C] -radiolabeled acyl-ACP substrate, (usually 
decanoyl-ACP is labeled in the carboxylate group to 50- 
60|LlCi/piol) in a total volume of 50jil, for a final 
decanoyl-ACP concentration of O.S-S.O^lM, typically S.OjIM, 
The reaction is stopped by adding 0.5ml 10% (v/v) cold (4°) 
acetic acid and placing the reaction mixture on ice for a 
few minutes. The fatty acid product of the hydrolytic 
enzyme action is extracted away from the unhydrolyzed 
substrate by adding 2ml diethyl ether and mixing 
vigorously. The ether is transferred to 5ml scintillation 
fluid for scintillation counting. Additional ether 
extracts may be performed to recover remaining traces of 
product for more accurate quantitation of the activity if 
desired. 



Example 8A 

In alternative to Example 8, enzyme activity is 
assayed by adding 25^.1 of sample to a screw-cap glass vial. 
Next, concentrated radiolabelled substrate [^^C]-C10 :0-ACP, 
5 54 . 7^iCi/nmol is added to the vial so that the substrate 
concentration will be 0.5\m in the final lOOfil assay 
volume. Finally enough assay buffer (lOOmM glycine-HCl, pH 
9, 0.2% CHAPS, lOmM B-mercaptoethanol) is added to. the vial 
so that the total volume is lOOp.1. The mixture is allowed 

10 to react by incubating at 30*^C for 30 minutes. The 

reaction is stopped by adding 0.5ml of 10% (v/v) acetic 
acid and then 1ml diethyl ether (anhydrous) . The 
radiolabelled free fatty acid product is extracted by 
vigorously mixing the stopped reaction. The ether phase is 

15 then transferred to 5ml of scintillation fluid and 

radioactivity determined by liquid scintillation counting. 

Example, 9 - Cuphea CIO Preferring Acyl-ACP Thioesterase 
Purification and Identification 

20 Immature seeds of Cuphea hookeriana are harvested. 

The total fatty acid composition of a few of the harvested 
seeds is analyzed by standard techniques to make sure that 
they are at the correct stage of development. This is 
defined as the stage at which octaiioate and decanoate 

25 predominate in the fatty acyl composition. The harvested 

seeds are stored frozen at -70**. This comprises the source 
tissue for enzyme extraction. 
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A first method for the purification and 
indentification of a Cuphea ClO-preferring acyl-ACP 
thioesterase is provided. 

An acetone powder is prepared by grinding the seeds to 
a powder in a mortar and pestle under liquid nitrogen, and 
then grinding the powder in a mortar and pestle with cold 
acetone (at approximately -20®) . The powder is collected 
by filtration and rinsed with cold ether to remove 
remaining traces of acetone. It is then extracted with 
10ml of "extraction buffer" per gram of acetone powder 
weight {this and all subsequent steps at 0-4**) comprising 
50mM KH2PO4-KOH pH 7.5, lOmM 2-mercaptoethanol, The 
homogenate is centrifuged at ll,000xg for 15 min at 4°, and 
the supernatant fraction used for subsequent purification 
steps after filtration through two layers of Miracloth 
(Calbiochem. Inc.; LaJolla, CA) . 

The supernatant fraction is then subjected to ammonium 
sulfate fractionation. The 40-60% saturation ammonium 
sulfate pellet (prepared as described in Example 2) is 
redissolved in "buffer" comprising 50mM KH2PO4-KOH pH 6.9, 

10% (v/v) glycerol, and lOmM 2-mercaptoethanol, and 
dialyzed against this buffer to remove remaining ammonium 
sulfate. 

The resulting preparation is then subjected to 
hydroxyapatite column chromatography. The following method 
applies to ammonium sulfate fraction from lOOg fresh weight 
of starting seed tissue. The dialyzed ammonium sulfate 
fraction (35-40ml) is applied to a column of hydroxyapatite 
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(2.5cm X 14cm bed height of Bio-Gel HTP from Bio-Rad Inc.; 
Richmond, CA, catalog no. 130-0420), equilibrated in 50mM 
KH2PO4-KOH pH 6.9, 10% (v/v) glycerol, 4mM 2- 

mercaptoethanol . The column is then washed (flow rate | 
5 1.5ml/min throughout) with 280ml of the same buffer. i 
Elution is accomplished with a 580ml linear gradient from ^ 
these conditions to 350mM KH2PO4-KOH pH 6.9, 10% (v/v) 

glycerol, 4mM 2-mercaptoethanol, collecting fractions of 

12ml volume. The eluted fractions are assayed for ' 

r- 

10 hydrolase activity using decanoyl-ACP as substrate. L 

I 

Two peaks of activity are obtained, one passing | 
through the column without binding, and the other binding L 
and being subsequently eluted with the phosphate gradient. 
Both peaks from the hydroxyapatite column contain 
15 hydrolytic activity towards long-chain substrates (acyl 
group of 14 or more carbon atoms) . As far as the medium- 
chain substrates are concerned, the flow-through peak shows 
little preference / whereas the gradient peak shows 

r 

considerable preference for decanoyl-ACP (See, Example 
20 llA) . L 
At an early stage in the partial purification, when 
buffered with lOOmM HEPES, the decanoyl-ACP ClO-preferring 
acyl-ACP thioesterase shows considerable tolerance of assay 
pH, activity changing minimally between pH 6.5 and 8.5, 
25 with a maximum at pH 7.5. In contrast there is sensitivity i 
to ionic strength in the assay, e.g. using potassium \ 
phosphate as the assay buffer activity declines as the ' 
phosphate concentration is raised, although activity is 
still detectable in 350 mM phosphate. 
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The ClO-preferring acyl-ACP thioesterase activity and 
other proteins in the partially purified preparations are 
lowered in concentration by extensive contact with glass 
and plastic surfaces. This effect "is reduced by the 
5 inclusion of detergents such as Triton XlOO or CHAPS in the 
column and assay buffers. Some detergents are stimulatory 
in the assay. 

The ClO-preferring acyl-ACP thioesterase activity is 
rapidly lost during the ammonium sulfate precipitation step 
10 of purification unless 2-mercaptoethanol is present in the 
buffers as described above. In the buffers described the 
activity is very stable both at 0" and during repeated 
freezing to -20* or -70°. 
ExampIP QR 

15 As a more preferred alternative to Example 9A, seeds are 

extracted as follows. 

An extraction paste is made with 1375ml of extraction 
buffer (200mM Bis-Tris-HCl, pH 6.5, lOmM B- 
raercaptoethanol), lOOg polyvinylpolypyrroiidone, andl3.75g 
20 soluble polyvinylpyrrolidone (10,000 average molecular 

weight). lOOg of Cuphea seeds are added to the paste. All 
subsequent steps are performed at 4'»C. The seeds and paste 
are homogenized with a Polytron until the mixture is smooth 
and there are no whole seeds intact. The homogenate is 
25 centrifuged at 10,000xg for 20 minutes. The supernatant is 
decanted and filtered through Miracloth. 

The filtered supernatant is mixed into a slurry with 
lOOml of the settled Blue-4 agarose resin that has been 
equilibrated with the extraction buffer. The slurry is 



washed on a Buchner funnel with 500ml of extraction buffer, 
then poured into a glass column and rinsed with more 
extraction buffer until the resin is packed. The column is 
first washed with lOOmM NaCl, 200mM' Bis-Tris-HCl, pH 6.5, 
5 lOmM B-mercaptoethanol. 400inM NaCl, 200mM Bis-Tris-HCl, pH 
6.5, lOmM B-mercaptoethanol is applied to the column and 
the eluate collected in fractions. Those fractions having 
enzyme activity are pooled and dialyzed against "S buffer" 
{50mM Bis-Tris-HCl, pH 6.0, 0.2% (w/v) CHAPS, lOmM fi- 

10 mercaptoethanol) . 

Next the sample is chromatographed on an S-Sepharose 
column as follows. The dialyzed sample from the Blue-4 
column is loaded on a 50ml column of S-Sepharose resin that 
has been equilibrated with S buffer. After washing the 

15 column with more S-buffer, the column is rinsed with 200mM 
NaCl, 50mM Bis-Tris-HClr pH 6.0, 0.2% (w/v) CHAPS, lOmM B- 
mercaptoethanol . Those fractions having enzyme activity 
are pooled and dialyzed a second time against S buffer. 

Next the sample is chromatographed on a Pharmacia FPLC 

20 (Piscataway, NJ) Mono-S column as follows. The dialyzed 

sample from the S-sepharose column is loaded on a 1ml Mono-- 
S column that has been equilibrated with S buffer. The 
column is washed with S-buffer until the 280nM absorbance 
has leveled. A 45ml gradient is applied to the column 

25 using S~buffer and S-buffer containing NaCl. The activity 
elutes between 75mM and 150mM NaCl. Those fractions with 
enzyme activity are pooled and dialyzed a third time 
against S buffer. 



Finally the sample is chromatographed on an ACP column 
as follows. A column containing 15ml of acyl-carrier 
protein coupled to Sepharose is equilibrated with S-buffer. 
The dialyzed sample from the Mono-3 column is loaded onto 
5 the ACP column at 0,2ml/min. The column is washed with S- 
buffer until the 280nm absorbance has leveled into a 
baseline. A 130ml gradient is applied to the column using 
S-buffer and S-buffer containing NaCl, The activity elutes 
between 50mM and 80mM NaCl. Those fractions having enzyme 
10 activity are pooled. 
Example 9C 

As a more preferred alternative to Example 9A or 9B, 
forty grams of polyvinylpolypyrrolidone (PVPP) are mixed 
with 550 ml "extraction buffer" comprising 200 mM bis-Tris- 
15 HCl pH 6., lOmM 2-mercaptoethanol, 1% (w/v) 

polyvinylpyrrolidone-10. To this mixture are added 40g 
frozen Cuphea seeds. The mixture is then blended in a 
Polytron homogenizer until no intact- seeds remain and the 
slurry is smooth. This and all subsequent steps are 
20 conducted at 0-4®C. The preparation is centrifuged at 
12,000xg for 20 min and further clarified by filtration 
through Miracloth. 

This preparation is mixed with 100ml of settled 
hydroxyapatite which has been equilibrated with "buffer A" 
25 (50mM bis-Tris-HCl pH 6./ lOpi 2-mercaptoethanol) . Three 
extract volumes of lOmM 2-mercaptoethanol are then added 
slowly over 30 min, with constant stirring. The 
hydroxyapatite gel is collected on a sintered glass funnel 
and rinsed with buffer A until the effluent is colorless. 



The collected hydroxy apatite is then transferred to a 
column and further rinsed with buffer A at 2inl/inin until 
the column is packed. A 400ml elution gradient is applied 
(2ml/min) r from buffer A to buffer B (300mM potassium 
5 phosphate pH 6,9 in buffer A). Effluent fractions are 

assayed for hydrolysis of 10:0-ACP. Two overlapping peaks 
of activity are obtained. The fractions comprising the 
later-eluting peak are pooled and dialyzed against buffer 
A. 

10 The dialyzed material is applied at 1.3ml/min to a 2.5 

X 6.5cm column of Blue 4 Agarose (Sigma Chemical Co.; St. 
Louis r MO) equilibrated with buffer A. The column is 
washed with buffer A, and enzyme activity is subsequently 
eluted with a 400ml gradient from buffer A to buffer C 

15 (buffer A containing IM NaCl) , The eluted fractions 

contain three peaks of 10:0-ACP hydrolysis activity. Those 
fractions comprising the second peak to elute (eluted by 
approximately 0.4m NaCl) are pooled and dialyzed against 
buffer A. 

20 The dialyzed material is applied at 0*5ml/min to a 1.7 

X 6cm column of immobilized 10:0-ACP analog equilibrated 
with buffer A. (This column is prepared by reacting 
heptylamine with iodoacetic anhydride in diethyl ether, and 
adding the product to purified, reduced E. coll ACP. The 

25 residual reagents are removed by gel filtration 

chromatography and the resulting 10:0-ACP analog is coupled 
to Pharmacia CNBr-activated Sepharose per the 
manufacturer's directions/ blocking unreacted groups with 
Tris.) The column is rinsed with buffer A and then 
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activity is eluted using a 200ml gradient from buffer A to 
buffer D (buffer A containing 0.5M NaCl) . Fractions 
corresponding to the eluted peak of 10:0-ACP activity are 
pooled and dialyzed against 50mM bis-Tris pH^ 6, lOmM 2- 
5 mercaptoethanol, 0.2% (w/v) CHAPS, 5mM sodium ascorbate. 
P3^$impj.g gp 

The protocol described in Example 90 may be further 
modified as follows. The fractions corresponding to the 
eluted peak of 10:0-ACP activity are pooled and dialyzed 

10 against buffer E {50mM Bis-Tris-HCl pH 6.0, 0.2% CHAPS, 
lOmM JJ-mercaptoethanol) . An FPLC Mono-S column (MonoS® 
HR5/5, Pharmacia LKB Biotechnology, NJ) is equilibrated 
with the buffer E. The dialyzed pool is loaded onto the 
column. All the CIO and C18:1 activity appears to bind to 

15 the column. The activities may be eluted with a linear 
140ml gradient from (buffer) to (butter+lM NaCl) . 

C18:l activity elutes between 75mM and lOOmM NaCl. 
There is a second peak of activity that elutes between 
ISOmM and 175mM NaCl. The second activity peak is 

20 primarily CIO and C18:l activity, with relatively little 
C12, C14, or C16 activity. Any C18:l activity in the 
second peak could be due to contamination by residual C18:l 
activity. 

25 Example 10 - CIO Acyl-ACP Thioesterase Inhibitor Studies 
Preliminary inhibitor studies with material from 
Example 9A indicate that the Cvphea ClO-preferring acyl-ACP 
thioestera.se is insensitive to phenylmethylsulfonyl 
fluoride, insensitive to iodoacetamide , and completely 



inhibited by SmM N-ethylmaleimide . This suggests that it 
is an "active thiol" type of esterase rather than an - 
"active serine" type. 

5 Example 11 - Cuphea CIO Acyl ACP Thloesterase Substrate 
Specificity as a Function of Chain Length 

Example llh 

The substrate specificity of Cuphea CIO acyl-ACP 
thioesterase towards medium-chain acyl-ACP's has been 
10 determined at the hydroxyapatite stage in purification, as 
described in Example 9A: 

Table 6 

Substrate Hydrolysis Activity 

(mean) (pmol/min).. 

15 C6-ACP 188 

C8-ACP 485 

ClO-ACP 6950 

Cll-ACP 649 

C12-ACP 1032 

20 C14-ACP 4055 

The activity towards the longer-chain substrate 14:0-ACP is 
considered to be due to the presence of long-chain 
thioesterase activity, analogous to long-chain 
thioesterases of safflower seed and avocado mesocarp tissue 

25 that are described in published literature. Assay of the 
preparation with the preferred substrate of such an enzyme, 
18:1-ACP, indicates the presence of substantial activity, 
consistent with this hypothesis. The activity towards 
10:0-ACP and the smaller amount of activity towards 8:0-ACP 

30 indicate the presence of the medium-chain- specific 
thioesterase responsible for medium-chain fatty acid 
production in developing Cuphea hookeriana seeds. 
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The reactions catalyzed have been shown to be simple 
hydrolysis. The ether-extracted products of both "time 
zero" reactions and one hour reactions with 6:0-ACP, 8:0- 
ACP, and 10:0-ACP substrates were chromatographed on silica 
G thin-layer plates (mobile phase: hexane/diethyl 
ether/acetic acid, 80:20:1 v/v) to determine the lipid 
class. Laurie acid was added as unlabeled carrier to 
inhibit evaporation of liberated short-chain free fatty 
acids. Tricaprin, dicaprin, monocaprin; and lauric acid 
were used as standards. The TLC plate was developed half- 
way and then air dried for 5 minutes. The plate was then 
returned to the tank and development was completed to 
within 1cm of the top of the plate. The developed plate 
was dried and then scanned for 800 mins on an AMBIS (AMBIS 
Systems, Inc. San Diego, CA) radiochromatogram scanner to 
quantitate radiolabeled spots. Following scanning, the 
plate was stained in iodine vapor for 15 minutes to 
visualize the lipids. The principal radiolabeled products 
co-migrated with the free fatty acids, and were 
substantially more radioactive in the samples incubated for 
1 hour compared with the zero-time controls . 

To verify that the chain lengths of the products were 
those of the corresponding substrates, the ether-extracted 
products (with an unlabeled free fatty acid mixture as 
carrier) were neutralized to phenolphthalein endpoint with 
KOH and then derivatized with bromphenacyl bromide and 
chromatographed by reverse-phase HPLC. A CIS column was 
used in conjunction with an acetonitrile/water gradient. 
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In all cases only one chain length of product was 
observed, identical to the substrate chain length. 
Example IIB 

The preparation from Example 9'C is relatively 
5 selective in its hydrolysis of acyl-ACP thioesters, as 
shown in the following table: (These activities were 
determined as follows. Twenty five )il of sample were added 
to 75|il assay buffer comprising lOOmM glycine-KOH pH 9, 
0.2% (w/v) CHAPS, lOmM 2-mercaptoethanolr and containing 

10 radiolabeled acyl-ACP for a final concentration of 0.5|IM. 
After incubation at 30°C for 60 min, the reaction was 
terminated by addition of 0.5ml 10% (v/v) acetic acid, and 
the liberated fatty acid product was extracted with 1ml 
diethyl ether. Enzyme activity was measured by the 

15 radioactivity of this ether extract, determined by liquid 
scintillation counting. A correction was applied for the 
small amount of non-enzymatic hydrolysis that took place.) 

Sybgty^tg Activity (gpm) 
20 10:0-ACP 1010 

12:0-ACP 393 
14:0-'ACP 30 
16:0"ACP 262 
18:1-ACP 696 



25 



The removal of long-chain thioest erase is incomplete, 
as evidenced by the partial overlap of all peaks from the 
Blue 4 Agarose column, and the data shown in the above 
table. 
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T^yam ple 12 - C-I8 Preferring Acyi-ACP Thloesterase Assay 

To assay for long chain thioesterase activity lOp.1 of 
the enzyme source to be analyzed is' incubated at room i 
5 temperature for 10 minutes in a solution comprising lOOmM j 
Tricine-NaOH, pH 8.5, and 3|1M ^^C stearoyl-ACP or 3^M ^^C ^ 
oleoyl-ACP, in a total volume of 50|Xl. Acyl-ACP substrates 
are prepared as described in Example 1 for preparation of 
lauroyl-ACP and radiolabeled in the carboxylate group at a 

10 specific radioactivity of 50-60|XCi/M^ol , ^ 
The reaction is stopped by the addition of 50^11 H2O ^ 
and lOOtll isopropanol which contains ImM each of stearic ^ 
acid and oleic acid. The fatty acid product of the 
hydrolytic enzyme action is extracted away from the 

15 unhydrolyzed substrate by adding 1ml petroleum ether that 
is saturated with 50% isopropanol in H2O. After settling 
for a few minutes, an aliquot of the petroleum ether layer 
is removed for determination of radioactivity by liquid |_ 
scintillation spectrometry. ; 

20 

pvamplfi 13 - Safflower C-18 Preferring Acyl-ACP 

Thioesterase Purification and Identification 

An initial purification of thioesterase protein from 
developing safflower seeds which initially follows the 
25 method of McKeon and Stumpf (J. Biol. Chem. (1982) 

257:12141-12147), is described. Developing safflower seeds ^ 
from greenhouse grown plants are harvested between 16 and 1 
18 days after anthesis, frozen in liquid nitrogen and 
stored at -70°C. 
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Approximately 50g of frozen seeds are ground in liquid 
nitrogen and sieved to remove large seed coat pieces to 
yield a fine powder. The powder is washed with acetone on 
a Buchner funnel until all yellow color is absent from the 
filtrate. The powder is then air dried and further 
processed as described below, or may be stored frozen at - 
70^0. 

The dried acetone powder is weighed and triturated 
with fifteen times its weight of 20mM potassium phosphate, 
pH 6.8. The mixture is then centrifuged at 12,000 x g for 
20 minutes and decanted through a layer of Miracloth. 

The acetone powder extract is acidified with glacial 
acetic acid to pH 5.2, held on ice for 30 minutes, and then 
centrifuged at 12,000 x g for 10 minutes. The supernatant 
is adjusted to pH 4.4 with glacial acetic acid, held on ice 
for 30 minutes, and then centrifuged as above. The 
precipitate is resuspended 0.02M potassium phosphate (pH 
6.8), the pH is adjusted to 6,8, and the suspension is 
clarified by centrifugation at 12,000 x g for 10 minutes, 

A column of ACP-Sepharose 4B for affinity 
chromatography is prepared as described in Example 2. ACP 
is isolated from E. coli strain K-12 as described in 
Example 1. The clarified supernatant from the acid 
precipitation is dissolved in ACP column buffer (20mM 
potassium phosphate, 25% glycerol (w/v), 0.1% CHAPS (w/v) , 
pH 6.8) and applied to a 2.5cm x 3.7cm ACP-Sepharose 4B 
column at an application rate of 50ml per hour. The 
activity is eluted with a 5 bed volume gradient of 20-400mM 
potassium phosphate in ACP column buffer. The activity 
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eluted 1„ a single paalc at 180-320« potassium phosphate, 
and recovery of the thloesteraae was 100%. 

The active fractions from the above SCP-Sepharose 
column were pooled, diluted to 20nM potassium phosphate, 
5 0.1% CHAPS, 23% glycerol, and applied to a different ACP- 
Sepharose column and chromatographed as described above. 

The resulting material was then applied to a 
ohromatofocuslng column for further purification of 
safflower thloesterase activity. The buffer of the "flow- 
10 through, from the second ACP column was changed to 20mM 
bis-Trls-HCl, 25% glycerol ,„/v) , 0.1% CHAPS <w/v) , pH 
7.4, ("Start" buffer) by concentration and dilution in an 
*»iccn (Danvers, m> stlrred-cell ultrafiltration apparatus 
using a PM-10 membrane. A ohromatofocuslng column ,1cm x 
15 7.3cm) is packed with Pharmacia PBE 94 which has been 

equilibrated in "Start" buffer. The senile is applied to 

the PBE 94 column at a rate of .. 

rate of 35ml per hour and the column 

is washed with 3 bed volumes of start buffer. The pH 
gradient is formed and the protein is eluted at 60ml per 
20 hour by the application of 82ml of elution buffer which 

contains, per 100ml, loml Pharmacia PB 74, 25g glycerol, ig 
CHAPS, and enough HCl to reach pH 4.0. An additional two 
bed volumes of the elution buffer Is applied after the pH 
Of the column has reached 4.0. The safflower thloesterase 
25 activity elutes In two pea.cs, one at about pH 5.2, and the 
second peak spanning from PH 4.5 to 4.0. Fractions 
representing these activity peaks are analyzed by SDS-PAGE 
(Laemmll, supra) and silver staining. 
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In both peaks, two major bands were observed which 
correlate with thioesterase activity. These bands 

represent proteins having relative molecular weights of 34 ^ 
and 40 kD as estimated by comparison to protein standards. | 
5 The fractions of the two activity peaks from the 

chromatofocusing are pooled separately, and concentrated as 
described above. The 34 and 40 kD thioesterase proteins 
are further isolated for amino acid sequencing by transfer 
of these proteins to either nitrocellulose or PVDF (either 

10 Immobilon-P (Millipore; Bedford, MA) or ProBlott (Applied I 
Biosystems; Foster City, CA) ) membranes following SDS-PAGE. ^ 
Nitrocellulose is preferred when proteins will be L 
subsequently enzymatically digested, while ProBlott is ^ 
preferred for N-terminal sequencing methods and Immobilon-P 

15 for samples to undergo cyanogen bromide digestion. 

Example 14 - Plant Thioesterase Sequencing 

In this example, amino acid.suid nucleic acid 
sequencing of two exemplified plant acyl-ACP thioesterases ^: 
20 is described. This technique may also be employed for the 1 
sequencing of other plant thioesterases of this invention 
as well. 

All sequencing is performed by Edman degradation on an 
Applied Biosystems 477A Pulsed-Liquid Phase Protein 
25 Sequencer; phenylthiohydantoin (PTH) amino acids produced j 
by the sequencer are analyzed by an on-line Applied 
Biosystems 120A PTH Analyzer. Data are collected and 
stored using an Applied BioSystems model 610A data analysis 
system for the Apple Macintosh and also on to a Digital 
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Microvax using ACCESS*CHROM software from PE NELSON, Inc. 
(Cupertino, CA) . Sequence data is read from a chart 

recorder, which receives input from the PTH Analyzer, and j 
is confirmed using quantitative data obtained from the ^ 
5 model 610A software. All sequence data is read I 
independently by two operators with the aid of the data ~ 
analysis system. 

For peptide samples obtained as peaks off of an HPLC, 
the sample is loaded onto a Polybrene coated glass fiber 

10 filter (Applied Biosystems, Foster City, CA) which has been h 
subjected to 3 pre-cycles in the sequencer. For peptides ^ 
which have been reduced and alkylated, a portion of the L 
PTH-amino acid product material from each sequencer cycle - 
is counted in a liquid scintillation counter . For protein 

15 samples which have been electroblotted to Immobilon-P, the 
band of interest is cut out and then placed above a 
Polybrene coated glass fiber filter, pre-cycled as above 
and the reaction cartridge is assembled according to ^ 

3 

manufacturer's specifications. For protein samples which ^ 
20 have been electroblotted to ProBlott, the glass fiber 1 
filter is not required. 

In order to obtain protein sequences from small 
amounts of sample (5-30 pmoles) , the 477A conversion cycle 
and the 120A analyzer as described by Tempst and Riviere 
25 (Anal. Biochem. (1989) 183:290). I 
A. Sequencing of Proteolytic Fragments i 
A sample of Bay thioesterase purified through the ACP-- 
Sepharose step of Example 2 is prepared for proteolytic 
digestion and sequencing. The sample. {12flg of thioesterase 



in 80^11) is denatured and reduced by heating to 95°C for 5 
minutes in 160M.1 of Andersons* sample buffer (Anderson & 
Anderson, Anal. Biochem. (1978) 85:331-340) containing 2% 
sodium dodecyl sulfate, 5% Ji-mercaptoethanol, 20% glycerol, 
5 2% 3/10 ampholytes, and 2% Triton X-100. Proteins in 20^.1 
aliquots {l\lg total protein in each) are separated by two- 
dimensional electrophoresis as described by Anderson and 
Anderson {Anal. Biochem. (1978) 85:331-340 and 341-354), 
except that the second dimension slab gel is 1,5mm in 
10 thickness. After the second dimension electrophoresis, ^ 

L 

each of the slab gels is removed and proteins within the t 
gel are blotted directly to a nitrocellulose membrane in a L 
transblot system (Bio-Rad, Richmond, CA) using the method I 
of Towbin et al {Proc. Nat, Acad. Sci. USA (1979) 75:4350- 
15 4354) . The protein spots on the membrane are detected by 
reversible staining with Ponceau S (Sigma, St. Louis, MO) 
as described by Salinovich and Montelaro {Anal. Biochem. 
(1986) 155:341-347) . Alternatively the spots may be . . 

*- 

detected by staining with amidoblack (Schaffner and I 
20 Weissman, Anal. Biochem. (1973) .55:502-514) . ]_ 
For preparations of Bay thioesterase or of 
thioesterases having undergone an additional 
chromatographic purification step, one -dimensional 
polyacrylamide gel electrophoresis is sufficient to produce 
25 protein pure enough for sequencing. In this case, the 
sample (12[lg of thioesterase in &0p.l) is reduced and 
denatured by heating to 95*^C for 5 min with 20^11 of a 
sample buffer containing 25% (v/v) glycerol, 2.5% (w/v) 
sodium dodecyl sulfate (SDS) , and 25% (v/v) B- 
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.- mercaptoethanol in 0.156M Tris-HCl, pH 6.8. Proteins in 
separate aliquots (30"-35|Lll) of the sample are separated by 
one-dimensional electrophoresis as described by Laemmli 
(Nature (1970) 227:580-685), one aliquot per 1-cm lane on a 
5 1.5mm thick gel. After completion of the electrophoresis, 
the gel is removed, blotted, and thereafter the samples are 
treated as described for the two-dimensional case. 

In preparation for digestion, spots corresponding to 
thioesterase protein are cut out of each of the membrane 
10 blots and are pooled together in a plastic test tube. The 
methods of treatment and digestion have been described by 
Aebersold et ai {Proc. Nat'l Acad Sci. U.S.A. 84:6910- 
6974)). The membrane pieces are treated for 30 min at 37^ 
C with 1.0-1. 2ml of freshly prepared 0.5% (w/v) 
15 polyvinylpyrrolidone with average molecular weight of 

40,000 (PVP-40, Aldrich, Milwaukee, WI) dissolved in lOOmM 
acetic acid- The excess PVP-40 is removed by several 
•washes with 3-4ml of water (HPLC grade), removal of PVP-40 
is complete when the absorbance at 214nm of successive 
20 washes no longer decreases or reaches that of a water 
blank. The pieces are then removed from the wash tube, 
minced and are placed in a 1-ml Eppendorf plastic tube, and 
lOOmM Tris-HCl or lOOmM sodium carbonate, pH 
8.2/acetonitrile, 95:5 (v/v) is added so that the liquid 
25 just covers the top of them. The digest is started by 
addition of 10)11 of Boehringer Mannheim sequence grade 
trypsin {100|LLg/ml solution in 1% HCl) , and the sample is 
allowed to digest at 37^C for 8-24 hr., with occasional 
stirring. The amount of protease added is usually between 
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1/20 and 1/10 of the weight of protein being digested. 
Peptides elute from the membrane into the digest buffer 
during the incubation. The digestion is terminated by 
addition of lOp.1 of 10% (v/v) trif luoroacetic acid (TFA) . 
Alternatively the chips may be suspended in lOOmM sodium 
phosphate or 25mM ammonium carbonate^ pH 7.8/acetonitrile, 
95:5 (vo/v) r and digested for 8-24 hours at 25''C with lOjil 
of Boehringer Mannheim sequence grade endoproteinase gluC 
(lOO^ig/ml solution in water) . 

Digestion with trypsin allows cleavage at lysine and 
arginine residues^ whereas digestion with gluC cleaves at 
glutamic acid residues (and also aspartic acid under some 
conditions) depending upon the buffer. Digestion of 
separate samples with each of the proteases affords 
identification of overlapping peptides and construction of 
longer peptide sequences useful for PGR technology. 

The digest mixture is removed from the nitrocellulose 
pieces, the nitrocellulose pieces are washed with 1-5 lOOjll 
volumes of 0.05% (v/v) TFA, and these volumes are 
concentrated to a volume of less than lOOjll in a Speed-Vac 
(Savant; Farmingdale, NY) . These concentrates are then 
injected over a Vydac reverse phase Protein & Peptide CIS 
column (2,1mm x 100mm) installed in an Applied Biosystems 
(Foster City, CA) Model 130 High Performance Liquid 
Chromatograph (HPLC) . Mobile phases used to elute peptides 
were: Buffer A: O.lmM sodium phosphate, pH2,2; Buffer B: 
70% acetonitrile in O.lmM sodium phosphate, pH2.2. A 3- 
step gradient of 10-55% buffer B over two hours, 55-75% 
buffer B over 5 minutes, and 75% buffer B isocratic for 15 
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minutes at a flow rate of 50p.l/minute is used. Peptides 
are detected at 214nm^ collected by hand, and then stored 
at c. 

Separation of the released peptides may also be 
accomplished through reverse-phase HPLC on a C18 (2 x 
ISOmm) column using a r20-min gradient increasing from 7% 
to 70% acetonitrile in 0.1% TFA at a flow rate of 50p,l per 
min. The elution of peptides is monitored by absorbance at 
214nm^ each peptide being collected into a separate 
fraction tube. The peptides are stored frozen at -20®C 
until application to the protein sequencer (Applied 
Biosystems, Foster City, CA) . 

Alternatively, the peptides may be alkylated before 
separation on HPLC. Alkylation allows identification of 
cystine residues on the sequencer, which otherwise go 
undetected. The unacidified digest mixture is reduced by 
addition of 1|J,1 of 10% (v/v) JJ-mercaptoethanol (l,43|lmol) 
and incubated at 37*'C for 2 hours. The reduced peptides 
are then alkylated with approximately 1.6|imol of [%]- 
iodoacetic acid (200mCi/mmol) for 30 min at room 
temperature in the dark. Depending upon the concentration 
of ii-mercaptoethanol the [^H] -iodoacetic acid may be 
adjusted to a ratio of 1:1.1. The mixture is then 
acidified with 10|LLl of 10% (v/v) TFA, applied to the 
reverse-phase HPLC column and further treated as described 
above. Other alkylating agents may be used including 
iodoacetamide and 4-vinylpyridine. The latter reagent 
leads to formation of pyridylethyl-cysteine residues which 
are identifiable on the protein sequencer by the unique 



retention time of its corresponding PTH-artiino acid 
derivative. 

The Bay thioest erases of the 34 kD doublet are 
sequenced independently (A and B) . ' Peptide sequences are 
shown in Table 8. It is noted that several of the band A 
and B peptides were either identical or near identical in 
sequence . 

B MP " A " 

YPTWPNFVL-T (M) L (I) (G) (A) (Q) 
DLMWW 

-GYNP- (D) IPFVG 
I 

LND— (HPLC crashed after #3) 

(T) -TLVDW{P) FVIWFVFIDNVAVK 

LNDLTADYIQS-LTP (R) 
S G 

AG (G) WVFETVPDXIFE 

NETGVIFWMW (A) FGP (I) 
K I 

SVGILGDGFGTTLEMSK 
G 

GISVIPAEP (R) 



SQ 736 
SQ 737 
SQ 739 

SQ 740 
SQ 741 
SQ 742 

SQ 743 
SQ 745 

SQ 747 

SQ 749 



SEQ ID 
SEQ ID 
SEQ ID 

SEQ ID 
SEQ ID 

SEQ ID 
SEQ ID 

SEQ ID 

SEQ ID 



NO: 1 

NO: 2 

NO: 3 

NO: 4 

NO: 5 

NO: 6 

NO: 7 

NO: 8 

NO: 9 
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'BAND B" 



SQ 


696 


SEQ 


ID 


NO: 


10 


LNDSTADYIQGGLTP 
L 


SQ 


697 


SEQ 


ID 


NO: 


11 


SVGILGDGFGTTLXMSK 


SQ 


698 


SEQ 


ID 


NO: 


12 


GISVIPAEPR 


SQ 


699 


SEQ 


ID 


NO,: 


13 


YVA (E) VFETVPDXIF 


SQ 


701 


SEQ 


ID 


NO: 


14 


STD ILAVMNXMQFATLNXAK 


SQ 


702 


SEQ 


ID 


NO: 


15 


— IGPAF (I) DNVAVK 


SQ 


703 


SEQ 


ID 


NO: 


16 


— IGPAFIDNVAVK 


SQ 


704 


SEQ 


ID 


NO: 


17 


(S) TSLSVLMNT 


SQ 


765 


SEQ 


ID 


NO: 


18 


DSIFES 


SQ 


766 


SEQ 


ID 


NO: 


19 


DYIQGGLTP-W 


SQ 


767 


SEQ 


ID 


NO: 


20 


DSVL-SLTTV-GGSSEA 


SQ 


768 


SEQ ID NO 


: 21 


DTVEVE-IIANs 
S 


SQ 


769 


SEQ 


ID 


NO: 


22 


D-FrGISVIPAEPr 


SQ 


770 


SEQ 


ID 


NO: 


23 


DSFrGISIVAEPr 


SQ 


772 


SEQ 


ID 


NO: 


24 


DWVIEYrPGV 


SQ 


773 


SEQ 


ID 


NO: 


25 


DHLLeLEGGsEVL-a 



N-terminal proteins can also be sequenced without 
digestion. For example, proteins are electroblotted to 
Inmiobilon-P PVDF for 30 minutes in the following buffer: 
12.5inM Tris/5mM glycine in 10% (v/v) methanol. Following 
electroblotting, membranes are stained in 0.1% (w/v) 
Coomassie Blue in 50% (v/v) methanol/10% (v/v) acetic acid 
for 5 minutes and destained in 2-3 changes of 50% (v/v) 
methanol/10% (v/v) acetic acid, 2 minutes for each change. 
Following this, PVDF membranes are allowed to air dry for 
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30 minutes and are then stored dry in heat-sealed plastic 
bags at -20°C, Protein blotted to PVDF is used directly to 
determine N-terminal sequence of the intact protein. 

In this manner, the N-terminal' amino acid of the bay 
5 34 kD thioesterase of band B is determined as: 
SQ 837 SEQ ID NO: 39 LEWKPKPK(L)PE (L) LD 

Further, the sequence of a bay thioesterase which 
migrates slightly faster than the Band B peptide is 
determined as: 

10 SQ 840 SEQ ID NO: 40 LLDDHFGLHGLVFRRTFAIRSYEVGPDR 

t 

B. Cyanogen BrQitiidft Cleavage of Protein and ^ 
Separation of Peptides L 

As an alternative method cyanogen bromide cleavage may 

15 be performed. For example, as exemplified with the 34 and 

40 kD saf flower thioesterase proteins using the methodology 

described in the Probe-Design Peptide Separation System 

Technical Manual from Promega, Inc. (Madison, WI) . The 

peptides shown below in Table 9 were obtained in this ^ 

20 manner. ! 

The proteins are electrophoresed on SDS-polyacrylamide L 

gels (Laemmli, supra) and blotted to Immobilon-P PVDF 

membrane as described above. Protein bands are cut out of 

the blot and each band is placed in a 1.5ml microcentrifuge 

25 tube containing 200^1 of a lOmg/ml solution of cyanogen 

bromide in 70% (v/v) formic acid. Protein bands are 

incubated in this solution overnight at room temperature, i 
and following this incubation the cyanogen bromide ' 
solutions are removed and pooled. The pooled solution is 
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dried under a continuous nitrogen stream using a Reacti-Vap 
Evaporator (Pierce, Rockford, IL) . Cyanogen bromide 
peptides are eluted off the Immobilon~P PVDF membrane using 
a peptide elution solvent with the following composition: 
70% (v/v) isopropanol, 0.2% (v/v) trifuoroacetic acid, 
O.lmM lysine, and O.lmM thioglycolic acid. 200^.1 of this 
elution solvent is added to each tube and tubes are 
incubated for 2 hours at room temperature with occasional 
vortexing. The elution solvents are then removed from each 
tube, pooled, added to the tube containing the dried 
cyanogen bromide solution, and dried as described above. 
The elution procedure is repeated with fresh elution 
solvent for an additional 2 hours and the pooled solvent is 
added to the previously dried material and again dried. 
50|Lil of HPLC grade water is then added to the dried 
peptides and the water removed by evaporation in a Speed- 
Vac (Savant, Inc., Farmingdale, NY). 

Peptides are separated using a Tris/Tricine SDS-PAGE 
system similar to that described by Schagger and von Jagow 
{Anal. Biochem. (1987) 155:368-379). Either 16% or 10-20% 
(gradient) acrylamide tricine-SDS-PAGE pre-cast gels (Novex 
Inc., Encinitas, CA) , are used for the separation. Gels 
are run in a Tall Mighty Small electrophoresis apparatus 
from Hoefer Scientific Instruments (San Francisco, CA) . 
Prior to electrophoresis of the peptides, gels are pre-run 
with thioglycolic acid added to the cathode buffer at a 
concentration of 0.1-0.2mM for 30-60 minutes at a constant 
voltage of 30 volts. Running buffer used is made up from a 
lOX stock, also from Novex; final concentration (IX) is 
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O.IM Tris, O.IM Tricine and 0.1% (w/v) SDS. The dried 
peptides are resuspended in 15^1 HPLC grade water and 15|il 
2X sample buffer consisting of: 0.125M Tris-HCl, 2% (w/v) 
SDS, 5% (v/v) B-mercaptoethanol, 20% (v/v) glycerol, and 
5 0.0025% (w/v) bromphenol blue, and boiled for 5 minutes 

prior to loading on the gel. ; 

Gels are run at a constant voltage of 125-150 volts I 
for approximately 1 hour or until the tracking dye has 
begun to run off the bottom edge of the gel. . Gels are 
10 soaked in transfer buffer (125mM Tris, 50mM glycine, 10% t 
(v/v) methanol) for 15-30 minutes prior to transfer. Gels £ 
are blotted to ProBlott sequencing membranes for 2 hours at f 
a constant voltage of 50 volts. The membranes are stained I 
with Coomassie blue (0.1% in 50% (v/v) methanol/10% (v/v) 
15 acetic acid) and destained for 3X 2 min. in 50% (v/v) 

methanol/10% (v/v) acetic acid. Membranes are air-dried 
for 30-45 minutes before storing dry at -20*'C. 

Peptides blotted on to ProBlott can be directly loaded 
to the sequencer cartridge of the protein sequencer without f 
the addition of a Polybrene-coated glass fibre filter. i 
Peptides are sequenced using a slightly modified reaction ' 
cycle, BLOT-1, supplied by Applied Biosystems. Also, 
solution S3 (butyl chloride), is replaced by a 50:50 mix of 
SI and S2 (n-heptane and ethyl acetate) . These two 
25 modifications are used whenever samples blotted to ProBlott 
are sequenced. 

i 

Amino acid sequences of cyanogen bromide fragments of . i 
the 34 and 40 kD proteins are determined by the N-terminal 
sequencing method described above. Sequences obtained in 



this manner are presented in Table 9, wherein the one- 
letter abbreviation for amino acids is used and X indicates 
an unidentified amino acid. 



5828 SEQ ID NO: 26 

5829 SEQ ID NO: 27 

5830 SEQ ID NO: 28 
8833 SEQ ID NO: 29 
S834 SEQ ID NO: 30 
.S834B SEQ ID NO: 31 



GSLTEDGLS YKEVF I IRXYEVGINKTA 

NKHVNNVTYIGXVLESIPQEVIDTHELQ 
* TITLDYRRE 

AVRTGEQPTGVAVGLKEA 

KDHASGQVIG 

NEDTRRLQKVNDD VEDE YLVF IP 
HIEIYXYPA . 



As the above protocol results in partial cyanogen bromide 
cleavage, peptides of varying relative molecular weights 
having common amino acid sequences are obtained. The amino 
acid sequence of one peptide (sequence not shown) was 
determined to correspond with amino acid sequence of a 
saf flower desaturase protein (Thompson et al., Proc. Nat. 
Acad. Sci. (1991) 59:2578-2582). 

C. Isolation and Assembly of cDNA 

Once partial amino acid sequences are determined, they 
may be used to obtain DNA sequence of the plant 
thioesterase via Polymerase Chain Reaction (PGR) 
technology. Thus, oligonucleotide fragments are 
synthesized on an Applied Biosys terns model 380A DNA 
synthesizer to amino acid sequences which have the least 
redundancy for use as PGR primers. Restriction sites are 
designed into the ends of the oligonucleotide primers so 
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that the resulting DNA fragments may be readily manipulated 
in cloning. Purified genomic DNA or RNA isolated from the 
plant thioesterase source are used as templates in 
reaction. 

5 PGR reactions are run using Taq polymerase (Gene Amp 

Kit) and the DNA thermal cycler (Perkin-Elmer/Cetus) in two 
different combinations of the oligonucleotides as 5'- or 
3'- primers. The resulting DNA products are run on an 
agarose gel for separation. DNA sequences are detemined 

10 by the dideoxy-chain termination method of Sanger et. ai, 
Proc. Natl. Acad. Sci. USA (1977) 74:5463-5467) using the 
7-Dea2a-dGTP Reagent Kit with Sequenase Version 2 Enzyme 
(United States Biochemical Corp., Cleveland, OH). The 
sequence data are analyzed using the IntelliGenetics Suite 

15 of molecular programs Gel and SEQ. 

i* R^A IsQlatioy^ 

Total RNA is isolated from developing Bay seeds 
according to the method of Turpen and Griffith 
(Biotechniques (1986) 4:11-15). Briefly, 50g of fresh 

20 frozen material is homogenized in 4M guanidine thiocyanate 
and 2% sarcosyl. The cleared lysate is layered upon a 5.7M 
CsCl cushion and centrifuged for 5.5 hours at 50,000 rpm. 
The RNA pellet is dissolved in water, extracted with phenol 
and chloroform, and precipitated with ethanol. The 

25 resulting pellet is resuspended in water and represents the 
total RNA fraction. Poly (A) RNA is isolated from this 
material according to Maniatis et ai. (Molecular Cloning: A 
Laboratory Manual (1982) Cold Springs Harbor, New York) . 
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2, PCR Genera tion of a Partial Thioesterase CPNA 
The protein sequence information from the peptides of 

Table 8 is used to design degenerate oligonucleotides (SEQ 
ID NO: 32-33). {See, Fig. 1). These oligonucleotides are 
5 used as primers in order to amplify thioesterase sequence 
from Bay embryo total cDNA (Lee et al. Science (1988) 
335:1288-1291) . Thus, poly (A) RNA from Bay embryos is 
reverse transcribed with M-MLV reverse transcriptase (BRL, 
Bethesda, MD) to obtain a single strand cDNA, This cDNA is 

10 used as a template for the thioesterase specific 

oligonucleotides in a PGR. The reaction is carried out 
according to manufacturer's instructions having the thermal 
cycler set for the following cycling program: 30 cycles; 1 
min. at 94^, 1 sec. at 65°, slope of 2 min from 65"^ down to 

15 50"*, and 2 min. at 74°. PGR reactions are analyzed by 
agarose gel electrophoresis. The DNA fragment 
corresponding to the resulting 800 bp band is cloned. DNA 
sequence analysis (SEQ ID NOS: 34-35 and Fig. 2) shows that 
indeed this DNA fragment codes for several of our 

20 thioesterase peptides. 

3. Isolation of Thioesterase cDNA Clones, 

The 800 bp PCR-generated DNA fragment is labeled with 
32p (Random Primed DNA labelling Kit, Boehringer Mannheim, 
Indianapolis, IN) and used as a probe to screen ^ 
25 approximately 2 million plaques of a conventionally created 
cDNA library: (double stranded, oligo dT primed cDNA is 
synthesized from the Bay seed poly (A) RNA according to 
Gubler and Hoffman, Gene (1983) 25:263-269; EcdRl linkers 
are ligated to the ends, and the resulting material cloned 



into a bacteriophage expression vector, La.*da2«P, 
Stratagene; la joHa, a. 

longest library cione overlaps ,or lu bp „itb our 
aoo, se^ence ^tch in this stretch, . it 

5 extends further to the 3' end of ti,« , 

end of the transcript, see Fig. 

By linking the 800 bp PCR frag^« with the longest 
bacteriophage clone at the shared «ndnx site ,.ee, .ig 
. lane ,343,,, a UOO bp long contiguous frag:»„t with 
potential reading fra^ of about 1000 coding basepairs is 
created. 

constructed fro. bay poly,.,. ^^^^ 

vector pCGm703. pCGN1703 is derived fr™ 
, . uerived from the commercial 

-".o, c.,, and made as follows, .he polyli„.er of 
Bluescribe m3- is altered by digestion with BamHI 
treatment with mung bean endonudease, and blunt-end 
ligation to create a Ba^i-deleted plasmid, pCGN1700. 
ao PCKnoo is digested with .coHI and .sti (adjacent 

^striction Sites, and annealed with a synthetic linger 
•saving restriction sites for BamHl, .stl, 
^«1. a 5. overhang of MTT, and a 3. overhang of rc^ 
The insertion of the linger into pc^noo eliminates the 

25 ScoRl site, recreates the SstI fai,„ 

tne sstI (also, sometimes referred to 

as "Sacl.. herein, site found in Bluescribe, and adds the 
new restriction sites contained on the linker. The 
resulting plasmid pCGN1702, is digested with «indIII and 
^lunt-ended with Klenow enzyme; the linear is partially 



digested with PvuII and ligated with T4 DNA ligase in 
dilute solution. A transformant having the lac promoter 
region deleted is selected (pCGN1703) and is used as the i 
plasinid cloning vector. ' i 

5 Briefly, the cDNA synthesis is as follows.. The 

plasmid cloning vector is digested with SstI and " 
homopolymer T-tails are generated on the resulting 3*- 
overhang stick-ends using terminal deoxynucleotidyl 
transferase. The tailed plasmid is separated from 

r 

10 undigested or un-tailed plasmid by oligo (dA) -cellulose ^ 

1- 

chromatography . The resultant vector serves as the primer ^ 
for synthesis of cDNA first strands . covalently attached to - 
either end of the vector plasmid. The cDNA-mRNA- vector 
complexes are treated with terminal transferase in the 

15 presence of deoxy guanos ine triphosphate, generating G-tails 
at the ends of the cDNA strands. The extra cDNA-mRNA 
complex^ adjacent to the BamHI site, is removed by BamHI 
digestion, leaving a cDNA-mRNA-vector complex with a BamEl . ^ 
stick-end at one end and a G-tail at the other. This 

20 complex is cyclized using an annealed synthetic cyclizing 1 
linker which has a 5 ' BamHI sticky-end, recognition . 
sequences for restriction enzymes Notl, fJcoRI and 5stl, and 
a 3' C-tail end. Following ligation and repair the 
circular complexes are transformed into E. coli strain DH5a 

25 (BRL, Gaithersburg, MD) to generate the cDNA library. The I 
bay embryo cDNA bank in plasmid vector pCGN1703 contains \ 
approximately 1.5x10^ clones with an average cDNA insert 
size of approximately 500 base pairs. 
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A full length cDNA of the bay thioest erase, 3 A- 17 
(pCGN3822), was isolated from the pCGN1703 library by 
screening with the ^^P-labeled 800 bp PCR-generated 

f 

fragment of thioesterase as described above. | 

f 

5 4. gPNA S?qugfice 

In siimmary, approximately 1200 bp of contiguous DNA 
sequence is shown in Fig. 2. This comprises about 80-90% of 
the coding region for the mature Bay thioesterase and a 200 
bp 3' untranslated sequence containing translational stop 
10 and poly (A) addition sequences. L 
The 580 bp of coding region now sequenced is estimated r 
to be about 50% of the total coding frame of the mature L 
protein. This partial sequence, when translated, codes for I 
a polypeptide which contains many sequences from Table 8 
15 (SEQ ID NOS: 1-25), some are shown aligned in Fig. 2. 
Peptides not coded for might be located in the not yet 
sequenced regions of the cDNA's or come from entirely 
different proteins. Several other peptides, like peptide 

r 

701 (SEQ ID NO: 14), are slightly different from the 1 
20 predicted protein sequence, see Fig. 2. This may indicate i 

the presence of a gene family for the thioesterase. 

A second 580 bp DNA fragment obtained through the cDNA 

library screen may also provide evidence of a gene family. 

This sequence shows approximately 80% sequence identity 
25 with the clone at the DNA level described above (Fig. 3) . ; 

The sequence in the upper line (SEQ ID NO: 35) represents ; 

the clone described above and the lower sequence line (SEQ 

ID NO: 36) represents the second 580 bp fragment fragment. 

At the amino acid level more degeneracy is seen. 



/ o 

Sequence (SEQ ID NO: 38) of the full length 
thioesterase cDNA clone, 3A-17, is presented in Fig. 4B. 
The translated sequence (SEQ ID NO: 37) of this clone is 
presented in Fig. 4A. The exact "start" methionine has not 
been determined, but is one of the methionines located at 
positions 1, 5 and 13 of the translated amino acid 
sequence. 

Example IS - Isolation of a Saf flower C-I8 Preferring Acyl- 
ACP Thioesterase cDNA 

Sequence information from cyanogen bromide peptide 

sequences (SEQ ID NOS: 26-31) of the safflower 34 and 40 kD 

protein bands of Table 9 from Example 14B is analyzed to 

obtain a peptide map of the safflower thioesterase protein. 

Comparison of the molecular masses (as estimated by SDS- 

PAGE) of peptides having common amino acid sequences is 

used to determine the order and distance between these 

peptides in the thioesterase protein. Homology comparisons 

of these peptides to the amino acid sequence of the bay 

thioesterase (Fig. 4B) confirms the peptide map shown in 

Fig. 5. Numbers between peptide sites indicate estimated 

base pair separation on a thioesterase cDNA for sequences 

which correspond to the S828, S829, S830 and S834 peptide 

sequences . 

Degenerate oligonucleotide primers for PGR are 
designed from amino acid sequences of safflower 
thioesterase peptide fragments S828, S829, 5830 and S834. 
The S830-derived oligonucleotide mixture, 830, is used as 
forward primer (binds to antisense strand and primes 
synthesis of sense thioesterase DNA) and the S829 
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oligonucleotide mixtures, 829-lR and 829-2R, are used as 
reverse primers (bind to sense strand and prime synthesis 
of antisense thioesterase DNA) in PGR reactions utilizing 
saf flower seed cDNA (from cDNA library described below) as 
5 template . 

Oligonucleotide mixture 830 contains all possible 
sequences that could encode amino acids 5-11 of peptide 
S830, except that the codon chosen for the glycine at 
position 5 is GGC, the codon ACC is chosen for the 

10 threonine at position 9 (with an inosine also being 
included at the third base), and only the first two 
nucleotides of the possible codons are included for the 
valine at position 11. S830 also contains non-thioesterase 
sequences at the 5' end which code for an Ncol site to 

15 facilitate cloning of the PGR products. 

Oligonucleotide mixture 829-lR contains all possible 
complements of sequences that could encode amino acids 1-6 
of peptide S829, except that only the first two nucleotides 
of the possible codons are included for the asparagine at 

20 -position 6. 829-lR also contains the complement for a 

methionine codon at the 3' end, as it can be assumed that 
there is a methionine residue at that position which was 
cleaved in the cyanogen bromide digestion, SB29-1R also 
contains non-thioesterase sequences at the 5' end which 

25 code for a Hindlll site to facilitate cloning of the PGR 
products . 

Oligonucleotide mixture 829--2R contains all possible 
complements of sequences that could encode amino acids 19- 
25 of peptide S829, except that an inosine base is included 
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for the third position of the codon for the isoleucine at 
position 22, the codon ACG is chosen for the threonine at 
position 24 (with an inosine also being included at the 
third base) , and only the first two nucleotides of the 
possible codons are included for the histidine at position 
25. 829-2R also contains non-thioesterase sequences at the 
5' end which code for a Hindlll site to facilitate cloning 
of the PCR products. 

Similarly,, oligonucleotide mixtures are designed from 
amino acids 10-16 of peptide S828 (828 is a forward primer 
with BaMiT site sequences at the 5 * end) , amino acids 12-18 
of peptide S834 (834 is a forward primer with Xbal site 
sequences at the 5' end), and amino acids 8-14 of peptide 
S834 (834R is a reverse primer with Sail site sequences at 
the 5 ' end) . 

PCR reactions are run using Tag polymerase and the DNA 
thermo cycler (Perkin/Elmer Cetus) according to 
manufacturer's specifications. Cycle parameters may be 
altered to provide for maximum yield of the thioesterase 
PCR product. 

PCR products are analyzed by agarose gel 
electrophoresis and the expected -SOObp band is observed. 
Oligonucleotides derived from S834 and S828 are used to 
verify that the band represents thioesterase DNA, either by 
further PCR using the S830/S829 PCR product as template, or 
by Southern hybridization of S830/S829 PCR product. DNA 
sequence of the ^-SOObp product is determined to verify that 
the fragment codes for a portion of the saf flower 
thioesterase protein. 
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The -800bp thioesterase fragment is labeled with 32p 
and. used as a probe to screen a saf flower cDNA library 
constructed in the plasmid cloning vector, pCGN1703. The ] 

i 

cDNA library is constructed from poly (A) + rna isolated from | 

I 

5 safflower embryos harvested at days 14-17 post-anthesis by ? 

■* i 

a method initially described by Jackson and Larkins (Plant t 
Physiol. (1976) 57:5-10) as modified by Goldberg et al. 
{Developmental Biol. (1981) 83:201-217). 

The polyadenylated RNA is used to construct a cDNA 
10 library in the plasmid cloning vector pCGN1703, as t 

described in Example 14. C. 3. The safflower embryo cDNA I 

i 

bank obtained in this manner contains between approximately L 
3-5x10^ clones with an average cDNA insert size of I 
approximately 1000 base pairs. 

15 . 

Example 16 - Expression of Medium-Chain Preferring Acyl-ACP 
Thioesterase In E. coli 

In this example, expression of bay thioesterase 

proteins in E. coli is described. . f 

20 The truncated Bay (1200 bp) cDNA described in Example ; 

14 is expressed as a 30 kD protein in an E. coli host cell 
and data is provided demonstrating that the cDNA fragment 
confers upon the transformant an increased C12 acyl-ACP 
thioesterase activity. 

25 A pET3a vector (Rosenberg, et al.. Gene (1987) 55:125- 

135) is used in an i;. coli strain BL21 (PE3) (Studier and 
Moffat, J. Mol. Biol. (1986) 185:113-130) host for this ■ 
study. The pET3a vector contains a promoter and 33 bp of 
the 5* reading frame of bacteriophase T7. T7 polymerase is 
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under the regulatory control of an isopropyl-b-D- 
thiogalactopyranoside (IPTG) -inducible lac UV5 promoter 
found in the E. coli BL21 (DE3) strain. Thus, by the 
addition of IPTG to E. coli BL21 (D'E3) transformed with 
pET3a, the T7 promoter will be activated • 

Constructs are prepared containing the truncated cDNA 
of Fig. 2 fused in reading frame by deletion of the 
BajnHI/EcoRI fragment and replacement of the thioesterase 
sequence. E. coli are transformed with pETSa constructs 
containing the thioesterase (pET3a-THI0) and unmodified 
pET3a as a control. The E. coli are grown at 37**C in 
liquid medium and expression is induced by the addition of 
ImM IPTG. After 1 hour induction, cells are harvested by 
centrifugation, resuspended in assay buffer and lysed by 
sonication. Cell debris is removed by further 
centrifugation and the supernant used in activity assays as 
per Example 1 . 



Hydrolysis Activity 
(mean cpm in 

En coli Lysate Assay Substrate ether extracts 

pET3a 8:0-ACP 370 

" 10:0-ACP 787 

12:0-ACP 1028 

14;0-ACP 1271 

" 16:0-ACP 2848 

" 18:1-ACP 2877 

pET3a-THI0 8:0-ACP 349 

" 10:0-ACP 621 

" 12:0-ACP 2127 

14:0-ACP 1035 

16:0-ACP 1900 

18:1-ACP 2025 
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The results demonstrate that a lysate of control E. 
coli cells contains hydrolytic activity towards all the ; 
acyl-ACP substrates that were tested, with preference for 
the long-chain substrates. Comparing the pETSa-THIO g 
5 results with the control results it is evident that the J 
pattern of substrate preferences differs. The transformant ^ 
lysate shows greatly increased activity with 12:0-ACP in 7- 
relation to the other substrates, as compared with the 
control lysate. This increased 12:0-ACP activity 

10 demonstrates that this cDNA fragment comprises sufficient |= 
of the the Bay 12:0-ACP thioesterase gene to produce active ^ 
enzyme in E. coli cells. L 
In addition, the entire mature bay thioesterase I- 
protein is expressed as a lac fusion in E. coli cells. 

15 Sequence analysis of the full length bay thioesterase cDNA, 
pCGN3822, described in Example 14, reveals an Xbal site at 
base 394. Digestion at this Xbal site cleaves the coding 
region inimediately 5' of the codon representing the leucine 

r 

at amino acid position 72. This leucine has been 1 
20 identified as a candidate for. the amino terminal residue as L 
described in Example 14A. 

An approximately 1200 bp fragment of pCGN3822 cDNA is 
generated by digestion with Xbal, which cuts at the 
postulated mature protein start site, as described above, 
25 and in the vector sequences flanking the 3' end of the j 
cDNA. The Xbal fragment is cloned on Xbal digest of the i 
minus version of a Bluescribe M13(+/-) (also called pBS+/-) ' 
cloning vector (Stratagene; San Diego, CA.). The 
thioesterase gene clone is inserted such that the* mature 
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protein is in reading frame with a portion of the lacZ gene 
of the Bluescribe vector and under control of the lac 
promoter . 

The resulting construct, pCGN3823, and a control 
5 Bluescribe construct having the bay thioesterase gene 

inserted in the opposite orientation are transformed into 
E. coll. The E. colx cells are grown at 37^0 in liquid 
medium and expression from the lac promoter is induced by 
addition of IPTG to a final concentration of O.lmM IPTG. 
10 Following one hour of induction, cells are harvested, lysed 
and assayed as described above for the truncated bay 
thioesterase . 



15 



20 



25 



30 



Induced 
£. coii 
Lysate 

PCGN3823 
•I 



Pilution 

1/4000 



control 



1/4000 



Assay 
Substrate 

8:0-ACP 
10:0-ACP 
12:0-ACP 
14:0-ACP 
16:0--ACP 
18:1"ACP 

8 : 0-ACP 
10:0-ACP 
12 : 0-ACP 
14:0-ACP 
16: 0-ACP 
18 : 1-ACP 



Hydrolysis Activity 
(mean cpm in 
ether extracts 

0 

0 

1840 
116 
20 
5 

0 
0 
0 
0 
13 
6 



The results demonstrate that a lysate from E. coll 
cells expressing the postulated mature bay thioesterase 
enzyme has significantly greater activity towards a 12:0- 
ACP substrate than towards other ACP substrates of varying 
carbon chain length. In addition, this activity is more 
35 than two orders of magnitude greater than that in a lysate 
of £. coll cells expressing the truncated bay thioesterase. 



studies are being conducted to determine if expression of 
the bay thioesterase protein in E. call cells has an effect 
on the fatty acid composition of these cells. Initial 
studies have failed to identify a substantial change in the 
fatty acid composites of the E. coll cells containing the 
bay thioesterase. 

Substrate analysis of the bay thioesterase enzyme 
purified from developing seeds as described in Example 2 is 
also conducted. Results are presented in Table 12 below. 

T^bl^ 1? 

Hydrolysis Activity 
(mean cpm in) 
Assay Subf?r.rarfi Ether Entrant ' 

8:0-ACP 0 

10 : O-ACP 0 

12:0-ACP 1261 

14:0-ACP 69 

20 16:0-ACP 12 

18 : 1-ACP 432 



10 



15 



Comparison of the results of substrate analysis of the 
thioesterase in the E. coli extracts and as purified from 

25 developing bay seeds reveals that the activity profile of 
the enzyme from the two sources is essentially identical 
with respect to activity with 10, 12, 14, and 16 ACP 
substrates. Although the enzyme purified from embryos is 
slightly more active with C18:l substrates than is the 

30 coli-expressed thioesterase, this difference is believed 
due to activity of a long chain bay thioesterase which is 
not completely removed from the medium chain thioesterase 
protein preparation. 
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Example 17 - Transformation with Plant Thioesterase 

A. Constructs for expression of bay thioesterase in 
plant cells which utilize phaseolin, napin, CaMV35S and 
Bce4 promoter regions are prepared as follows. 

A 1.45kb fragment of 3A-17 is obtained by digestion 
with Bail and Sail. The Bail site is located at position 
149 of the cDNA insert, and the Sail site is in the 
poly linker located 3' to the cDNA insert. Thus, this 
10 fragment contains the entire thioesterase coding region and 
the entire cDNA 3' region, including the polyadenylation 
signal, AAATl^, located at bases 1447-1452, and also 
contains the restriction digestion sites Kpnl, Smal, Xbal 
and Sail located directly 3' to the cDNA. 
15 An 850bp Bglll fragment of the fl-phaseolin 5 ' 

noncoding region was obtained from p8.8pro (Hoffman et al. 
(1987) EMBO J. 5:3213-3221) and cloned into pUC9 (Vieira 
and Messing, supra) at the BamHI site to yield pTV796. The 
phaseolin fragment in pTV796 is oriented such that Smal 
20 site of pUC9 is located 3' to the phaseolin promoter. An 
-850bp fragment is generated by digestion of pTV7 96 with 
flij7dIII and Sinal and gel-purified. 

The phaseolin promoter (Hii3dIII/SjnaI) and thioesterase 
coding region {Ball/ Sail) are joined by three way ligation 
25 into a Bluescript (Stratagene) cloning vector that has been 
digested with Hindlll and Sail. The resulting plasmid 
contains the phaseolin promoter/thioesterase construct on a 
flii^dlll/Sall fragment that is flanked by various 
restriction sites, including a 5' BaniRl site and a 3' Kpnl 
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site. No additional plant 3' noncoding region is provided 
as the thioesterase fragment contains a polyadenylation 
signal. The phaseolin promoter/thioesterase fragment may 
be obtained by digestion with 5a;nHI and Kpnl, or 
5 alternatively by partial digestion with Xbal, and ligated 
into an appropriate binary vector, such as pCGN1557 or 
pCGN1578 (McBride and Summerfelt, (1990) Plant Mol. Biol. 
14:269-216) f for plant transformation. 

10 An Ball/Pstl fragment of the thioesterase cDNA 3A-17 

containing approximately 1200bp, and including the entire 
coding region, is obtained by partial digestion with 
restriction enzymes jBall and PstI and gel-purification of 
the 1200bp fragment. The fragment is ligated into a 

15 plasmid cloning vector, such as a Bluescript vector 

(Stratagene Cloning Systems; La Jolla, CA) , that has been 
digested with PstI and BamHI, and the BamHI site filled in 
using the Klenow fragment of DNA Polymerase I. In this 
procedure, the BamHI site is restored by ligation to the 

20 Ball site of the thioesterase cDNA. 

The resulting plasmid is partially digested with BamHI 
and £:coRI to obtain the approximately 12d0bp thioesterase 
fragment. This fragment is then cloned into an 
approximately 4.4kb BamHI/BcoRI DNA fragment which contains 

25 approximately 0.94kb of 5' noncoding sequence from a 

cauliflower mosaic (CaMV) 35S gene (immediately 5' to the 
SamHI site), approximately 0.77kb of 3' noncoding sequence 
from an Agrobact era urn tumef a clens manopine synthase (mas) 
gene (immediately 3' to the JScoRI site), and a pUC19 (New 
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England BioLabs, Beverly, MA) backbone. The Bainai / EcdRl 
DNA fragment is obtained by partial digestion of a larger 
plasraid vector and gel purification of the desired 4.4kb 
fragment. The 35S 5' region is from bases 6492 to 7433 of | 
5 strain CM1841 (Gardner, et al. (1981) Nucl. Acids Res. 
5:2871-2887), which is from about -640 to about +2 in 
relation to the transcription start site. The mas 3' 
noncoding region is from about bases 19,239 to 18,474 of 
octopine Ti plasmid pTiA6 (numbering corresponds to that of 

10 closely related pTil5955 as reported by Barker et ai. k 

i- 

(Plant Mol. Biol. (1983) 2:335-350)). ^ 
The resulting 35S/thioesterase/mas plasmid is digested L 
at flanking BgJII sites and cloned into a BamHI digested 
binary vector, such as pCGN1557 or pCGN1578 (McBride and 
15 Summer felt, supra) . 

A 1.45kb thioesterase cDNA Ball/ Sail fragment is 
prepared as described above. A Bce4 expression cassette, ^ 
pCGN1870, which provides for preferential expression in 
20 early seed development is described in copending US Patent L 
Application Serial No. 07/494,722, which is incorporated 
herein by reference. 

An approximately Ikb fragment of the Bce4 5' noncoding 
region whose 3 ' end is immediately 5 ' to the Bce4 start 
25 codon, is obtained by digestion of pCGN1870 with Xbal and 2 
Xhol and gel purification of the resulting Ikb fragment. J 

The Bce4 promoter (Xbal/Xhol) and thioesterase coding 
region (BaII/5aII) are joined by three way ligation into a 
Bluescribe (Stratagene) cloning vector that has been 



digested with Xhal and Sail. The resulting plasmid 
contains the Bce4 promoter/thioesterase construct on a 
Xbal/Sall fragment that is flanked by various restriction 
sites ^ including a 5' BamRl site and a 3' Kpnl site. No 
5 additional plant 3 ' noncoding region is provided as the 
thioesterase fragment contains a polyadenylation signal • 
The Bce4 promoter/thioesterase fragment may be obtained by 
digestion with Ba/nHI and partial digestion with iiTpnl (or 
Asp718 which has the same recognition sequence) , or 

10 alternatively by partial digestion with Xbal, and ligated I 

f 

into an appropriate binary vector, such as pCGN1557 or | 

r 

PCGN1578 (McBride and Summerfelt, supra) ^ for plant 

transformation. ; 

15 The napin expression cassette, pCGN1808, is described 

in copending US Patent Application serial number 
07/550,804, which is incorporated herein by reference, 
pCGN1808 is modified to contain flanking restriction sites 
to allow movement of only the expression sequences and not ] 

20 the antibiotic resistance marker to binary vectors such as ^ 
pCGN1557 (McBride and Sximmerfelt, supra) . Synthetic 
oligonucleotides containing Kpnl, Notl and Hindlll 
restriction sites are annealed and ligated at the unique 
flindlll site of pCGN1808, such that only one fiindlll site 

25 is recovered. The resulting plasmid, pCGN3200 contains 

unique Hindlll, Notl and iCpnl restriction sites at the 3'- , 
end of the napin 3 ' -regulatory sequences as confirmed by ^ 
sequence analysis. 
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The majority of the napin expression cassette is 
subcloned from pCGN3200 by digestion with flindlll and Sad 
and ligation to Hindlll and Sad digested pIC19R (Marsh, et 
al. (1984) Gene 52:481-485) to make' pCGN3212. The extreme 
5 5 '-sequences of the napin promoter region are reconstructed 
by PGR using pCGN3200 as a template and two primers 
flanking the Sad site and the junction of the napin 5*- 
promoter and the pUC backbone of pCGN3200 from the pCGN1808 
construct. The forward primer contains Clal, flindlll, 

10 WotI, and Kpnl restiction sites as well as nucleotides 408- 
423 of the napin 5 '-sequence (from the EcoRV site) and the 
reverse promer contains the corrplement to napin sequences. 
718-739 which include the unique Sad site in the 5'- 
promoter. The PGR was performed using in a Perkin 

15 Elmer/Cetus thermocycler according to manufacturer's 

^ specifications. The PGR fragment is subcloned as a blunt- 
ended fragment into pUC8 (Vieira and Messing (1982) Gene 
15:259-268) digested with Hindi to give pCGN3217. 
Sequenced of pCGN3217 across the napin insert verifies that 

20 no improper nucleotides were introduced by PGR. The napin 
S-sequences in pGGN3217 are ligated to the remainder of the 
napin expression cassette by digestion with Clal and Sad 
and ligation to pCGN3212 digested with Clal and Sad. The 
resulting expression cassette pCGN3221, is digested with 

25 fiindlll and the napin expression sequences are gel purified 
away and ligated to pIC20H (Marsh, supra) digested with 
flindlll. The final expression cassette is pGGN3223, which 
contains in an ampicillin resistant background, essentially 
identical 1.725- napin 5' and 1.265 3' regulatory sequences 



as found in pCGN1808 • The regulatory regions are flanked 
with HindlUr Notl and Kpnl restriction sites and unique 
Sail, Bglllf Pstlf and Xhol cloning sites are located 

between the 5' and 3' noncoding regions. - 
5 The 1200bp Ball/Pstl thioesterase cDNA fragment 

described above is cloned into the napin expression r 
cassette, pCGN3223, which has been digested with Sail, and 
the Sail site filled in using the Klenow fragment of pNA 
Polymerase I, followed by digestion with PstI, The Sail 

10 site is reconstituted in this ligation. L 
The napin/thioesterase/napin plasmid generated by ^ 
these manipulations is digested with BajnHI and partially L 
digested with Kpnl to generate an approximately 3.3kb I 
fragment. This fragment contains -1.7kb of napin 5' 

15 noncoding sequence, the -'1200bp Ball/Pstl thioesterase cDNA 
fragment and --O.SSkb of 3' napin noncoding region, the rest 
of the 1 .265^3 of the napin 3' having been deleted due to 
the BaiMl site in this region. The -'3.3kb fragment is 

i 

ligated to Kpnl/BaitiEl digested pCGN1557 or pCGN1578 \ 
20 (McBride and Summerfelt, supra) for plant transformation. L 
B. A variety of methods have been developed to insert a 
DNA sequence of interest into the genome of a plant host to obtain 
the transcription or transcription and translation of the sequence 
to effect phenotypic changes . 
25 gyflggjg^ TyangfQrinatAQn 

Seeds of Brass ica napus cv, Westar are soaked in 95% i 
ethanol for 2 min. surface sterilized in a 1,0% solution of ^ 
sodium hypochlorite containing a drop of Tween 20 for 45 
min., and rinsed three times in sterile, distilled water. 



Seeds are then plated in Magenta boxes with 1/lOth 
concentration of Murashige minimal organics medium (Gibco; 
Grand Island, NY) supplemented with pyriodoxine (50p.g/l) , 
nicotinic acid (50|lg/l) , glycine (2D0|Xg/l) , and 0,6% 
Phytagar (Gibco) pH 5.8* Seeds are germinated in a 
Percival chamber at 22®C. in a 16.h photoperiod with cool 
fluorescent and red light of intensity approximately 65[L 
Einsteins per square meter per second (|iEm~2s~^) . 

Hypocotyls are excised from. 5-7 day old seedlings, cut 
into pieces approximately 4mm in length, and plated on 
feeder plates (Horsch et al., Science (1985) 227:1229- 
1231) . Feeder plates are prepared one day before use by 
plating 1.0ml of a tobacco suspension culture onto a petri 
plate (100x25mm) containing about 30ml MS salt base 
(Carolina Biological, Burlington, NC) lOOmg/1 inositol, 
1.3mg/l thiamine-HCl, 200mg KH2PO4 with 3% sucrose, 2,4-D 
(l.Omg/1), 0.6% w/v Phytagar, and pH adjusted to 5.8 prior 
to autoclaving (MS 0/1/0 medium) • A sterile filter paper 
disc (Whatman 3mm) is placed on. top of the feeder layer 
prior to use. Tobacco suspension cultures are subcultured 
weekly by transfer of 10ml of culture into 100ml fresh MS 
medium as described for the feeder plates with 2,4-D 
(0.2mg/l), Kinetin (O.lmg/1). In experiments where feeder 
cells are not used hypocotyl explants are cut and placed 
onto a filter paper disc on top of MSO/1/0 medium. All 
hypocotyl explants are preincubated on feeder plates for 24 
h. at 22®C in continuous light of intensity 30|lEm"2s'"l to 
65[lEM'"2s-l . 



Single colonies of A. tumef aciens strain EHA 101 
containing a binary plasmid are transferred to 5ial MG/L 
broth and grown overnight at 30°C. Hypocotyl explants are : 
immersed in 7 -12ml MG/L broth with bacteria diluted to 
5 1x10^ bacteria/ml and after 10-25 min. are placed onto 

feeder plates. Per liter MG/L broth contains 5g mannitol, r 
Ig L-Glutaraic acid or 1.15g sodiuiti glutamate, 0.25g ICH2PO4, 
O.lOg NaCl, O.lOg MGSO4-7H20, Img biotin, 5g tryptone, and 
2.5g yeast extract, and the broth is adjusted to pH 7.0. 
10 After 48 hours of co-incubation with Agrobacteriu/n, the L 

hypocotyl explants are transferred to B5 0/1/0 callus \ 

% 

induction medium which contains filter sterilized 
carbenicillin (500mg/l, added after autoclaving) and 
kanamycin sulfate (Boehringer Mannheim; Indianapolis, IN) 
15 at concentrations of 25mg/l, 

After 3-7 days in culture at 65}iEM"*2s-l continuous 
light, callus tissue is visible on the cut surface and the 
hypocotyl explants are transferred to shoot induction 

i 

medium, B5BZ (B5 salts and vitamins supplemented with 3mg/l ; 

20 benzylaminopurine, lmg/1 zeatin, 1% sucrose, 0.6% Phytagar 
and pH adjusted to 5.8) . This medium also contains 
carbenicillin (500mg/l) and kanamycin sulfate (25mg/l) . 
Hypocotyl explants are subcultured onto fresh shoot 
induction medium every two weeks • 

25 Shoots regenerate from the hypocotyl calli after one 

to three months. Green shoots at least 1cm tall are ] 
excised from the calli and placed on medium containing B5 ' 
salts and vitamins, 1% sucrose, carbenicillin (300mg/l) , 
kanamycin sulfate (50rag/l) and 0.6% w/v Phytagar). After 
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2-4 weeks shoots which remain green are cut at the base and 
transferred to Magenta boxes containing root induction 
medium (B5 salts and vitamins, 1% sucrose, 2mg/l 
indolebutyric acid, 50mg/l kanamycih sulfate and 0.6% 
5 Phytagar) . Green rooted shoots are tested for thioesterase 
activity. 

Peanut T ransformation 

DNA sequences of interest may be introduced as 
expression cassettes, comprising at least a promoter 
10 region, a gene of interest, and a termination region, into 
a plant genome via particle bombardment as described in 
European Patent Application 332 855 and in co-pending 
application USSN 07/225,332, filed July 27, 1988, 

Briefly, tungsten or gold particles of a size ranging 
15 from 0.5^M-3|LtM are coated with DNA of an expression 

cassette. This DNA may be in the form of an aqueous 
mixture or a dry DNA/particle precipitate. 

Tissue used as the target for bombardment may be from 
cot y ledonary explants,' shoot meristems, immature leaflets, 
20 or anthers. 

The bombardment of the tissue with the DNA-coated 
particles is carried out using a Biolistics*'''^ particle gun 

(Dupont; Wilmington, DE) • The particles are placed in the 
^ barrel at variable distances ranging from lcm-14cm from the 

25 barrel mouth. The tissue to be bombarded is placed beneath 
the stopping plate; testing is performed on the. tissue at 
distances up to 20cm. At the moment of discharge, the 
tissue is protected by a nylon net or a combination of 
nylon nets with mesh ranging from 10\m to 300JIM. 



Following bombardment/ plants may be regenerated 
following the method of Atreya, et al., (Plant Science 
Letters (1984) 34:379-383) . Briefly, embryo axis tissue or 

! 

cotyledon segments are placed on MS medium (Murashige and | 
5 Skoog, Physio. Plant. (1962) 15:473) (MS plus 2.0 mg/1 6- • : 
benzyladenine (BA) for the cotyledon segments) and ^ 
incubated in the dark for 1 week at 25 ± 2°C and are 
subsequently transferred to continuous cool white 
fluorescent light (6.8 W/m^) . On the 10th day of culture, 

10 the plant lets are transferred to pots containing sterile i 
soil, are kept in the shade for 3-5 days are and finally | 
moved to greenhouse. 1 
The putative transgenic shoots are rooted. ^ 
Integration of exogenous DNA into the plant genome may be 

15 confirmed by various methods know to those skilled in the 
art . 

Example 18 - Obtaining- Other Plant Thioesterases 

Having obtained sequence (amino acid and DNA) for Bay I 

20 thioesterase, similar genes from other plant sources can be : 
readily isolated. In this example, two methods are 
described to isolate other thioesterase genes: (A) by DNA 
hybridization techniques' using sequences or peptide 
sequence information from the Bay thioesterase gene and (B) 

25 by immunological cross-reactivity using antibodies to the 
Bay protein as a probe. 

In either of these techniques, cDNA or genomic 
libraries from the desired plants are required. Many 
methods of constructing cDNA or genomic libraries are 
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provided for example in Chapter 8 and 9. of Maniatis, et al. 
{Molecular Cloning: A Laboratory Manual, Second Edition 
(1989) Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York) . The method described in Example 14 can also be 
5 used for cDNA library construction. 

A. Probes for use in DNA hybridizations to isolate 
other plant thioesterase genes can be obtained from the Bay 
thioesterase gene sequences provided or alternatively by 
PCR using oligonucleotides from the Bay thioesterase 
10 peptide sequence provided. 

In this example, the 800 bp PCR-generated DNA fragment 
is used as a probe. Northern analysis of embryo RNA from 
the desired plant species is conducted to determine 
appropriate hybridization conditions . RNA is isolated from 
15 embryo as described in Example 14. C, electrophoresed in a 
formaldehyde/agarose gel and transferred to a nylon 
membrane filter as described by Fourney, efc al. (Focus 
(1988) Bethesda Research Laboratories/Life Technologies, 
Inc., 10:5-7) . The 32p-iabeled probe (Random Primed DNA 
20 labeling kit, Boehringer Mannheim, Indianapolis, IN) is 

added to a hybridization solution containing 50% formamide, 
6 X SSC (or 6 X SSPE) , 5 x Denhardt's reagent, 0.5% SDS, 
and 100|Xg/ml denatured salmon sperm DNA fragments. 

^ The hybridization solution containing the labeled 

25 probe- is incubated with the Northern filter at 

approximately 40**C for 18 hours or longer to allow 
hybridization of the probe to homologous (50-80%) 
sequences. The filter is then washed at low stringency 
(room temperature to 42*'C in about IX SSC) . 



Hybridization and washing temperatures may be adjusted 
based on the estimated melting temperature of the probe as 
discussed in Beltz, et al. {Methods in Enzymology (1983) 
100:266-285) . In further testing the temperature is raised 
5 either in the hybridization or washing steps ^ and/or salt 
content is lowered to improve detection of the specific 
' hybridizing sequence. 

A useful probe and appropriate hybridization and 
washing conditions having been identified as described 

10 above^ cDNA libraries are screened using the ^^p-i^beled 
fragment and optimized conditions. 

For example, a camphor {Cinnamomum camphora) 
thioesterase clone may be isolated using nucleic acid and 
amino acid sequence information from the bay and saf flower 

15 thioesterase clones. Homology of the bay thioesterase cDNA 
clone to RNA isolated from developing camphor embryos is 
observed by Northern analysis as follows. Total RNA is 
isolated from Ig of developing camphor embryos by 
adaptation of the SDS /phenol extraction method described in 

20 Current Protocols in Molecular Biology r pages 4.3.1-4.3.4 
(Ausubel et al., eds. (1987); John Wiley & Sons). The 
grinding buffer for this extraction contains lOOmM LlCl, 
lOOmM Tris pH9, lOmM EDTA, 1%SDS and 0.5% fi- 
mercaptoethanol . For extraction from Ig of embryos, 10ml 

25 of grinding buffer plus 3ml of phenol equilibrated to pH8 
are added to powdered embryos. The homogenization step may 
be conducted in a mortar instead of with a polytron, as 
described in the published method, and the heating step 
which follows homogenization in that method is omitted. 
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Centrifugation, phenol/chloroform extractions of the sample 
and LiCl precipitation of RNA are as described. 
Total RNA (10-20jlg) is electrophoresed in a 

formaldehyde/agarose gel and transferred to a nylon 
membrane filter as described by Fourney at al, (supra) , A 
probe for hybridization of the Northern filter is prepared 
from a 5aII digest of pCGN3822, the full length bay 
thioesterase cDNA by PGR using oligonucleotides to the 
saf flower thioesterase cDNA sequence to generate an 
approximately 1300bp fragment. The forward primer contains 
nucleotides 212 to 228 of the safflower thioesterase cDNA 
sequence (SEQ ID NO: 38) and the reverse primer is the 
complement to nucleotides 1510-1526 of the cDNA sequence. 
The fragment is gel purified using a Prep-A-Gene DNA 
purification kit (BioRad; Richmond, CA) and radiolabeled 
using a Boehringer Mannheim (Indianapolis, IN) random 
priming labeling kit. The Northern filter is hybridized 
overnight in 50% formamide, 5X SSC, 50mM sodium phosphate 
(pH7), 5X Denhardt-Vs solution, 0.1% SDS, 5mM EDTA and 
O.lmg/ml denatured DNA at 30^C. The filter is washed twice 
(15 minutes each wash) in O.IX SSC, 0.1% SDS. 
Autoradiography of the hybridized filter reveals a strong 
hybridization signal to an approximately 1300bp RNA band in 
the camphor embryo sample. This band is approximately the 
same size as the bay thioesterase mRNA. 

To obtain a fragment of the camphor thioesterase gene, 
PGR is conducted using oligonucleotides to peptides 
conserved between the bay and safflower thioesterases . The 
forward primer, 828-1-2800, is a mixture of all possible 
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coding sequences for amino acids 101-107 of the bay 
thioesterase protein (SEQ ID. NO: 37) and amino acids 16-22 
of saf flower thioesterase peptide S828 (SEQ ID NO:26) . j 
These amino acid sequences are idei^itical if one assumes the - 
5 unknown amino acid at position 18 of S828 is a serine. The 

primer (28bp) is designed such that an inosine (I) or t~ 
cytosine (C) is incorporated where the base could be any of 
A, G, T or C. The unknown third base of the final amino 
acid, glycine, is not incorporated into the oligonucleotide 
10 sequence, and the sequence CTGGATCC is added at the 5' end ^ 

3- 

to include a BaxriEl restriction site. The reverse primer, |-. 
829-2a-2798, is a mixture of all possible complements to _ 
the coding sequences for amino acids 271--276 of the bay 
thioesterase protein (SEQ ID NO: 37) and amino acids 1-6 of 
15 saf flower thioesterase peptide 3829 (SEQ ID NO: 27) . These 
6 amino acid peptide regions differ only in their second 
amino acid (this residue having been identified as a 
glut amine in the bay and a lysine in the saf flower 

I 

peptides) and both amino acid codons are represented in the ! 
20 oligonucleotide mixture. The reverse primer (26bp) [_ 

incorporates an I or a C where the base could be A, G, T or 

C, and the sequence GCCTCGAG is added at the 5' end to add 

an Xhol restriction site. 

Polymerase chain reactions are conducted using reverse 
25 transcribed camphor RNA as template. The reactions are 

conducted in a Biosycler Oven (Bios Corp,; New Haven, CT) 

programmed for the following cycles: 



N 



950c for 2 min. 



P 



95^C for 15 sec. 



1 sec. drop to 65*^C 
hold 65°C for 1 sec. 

2 min. drop to 45<^C 
hold 45<^C for 30 sec. 
1 sec. rise to 72^0 
hold 72^0 for 30 sec. 
1 sec. rise to 95^C 



hold 55^0 for 15 sec. 
. 1 sec. rise to 72^0 
hold 72^0 for 15 sec. 
1 sec. rise to 95^C 



1 sec. drop to 65^C 
hold 65^C for 1 sec. 

2 min, drop to 55^C 



Cycle N is run and repeated 6 times after which cycle P is 
run and repeated 37 times. 

An approximately 500-600bp band is identified by 
agarose gel electrophoresis of the PGR products. This is 
the approximate fragment size predicted from analysis of 
the distance between the peptides in the bay thioesterase 
sequence. The PGR fragment is subcloned into an 
appropriate cloning vector and its DNA sequence determined 
to verify thioesterase sequence. The fragment can then be 
utilized to screen a camphor cDNA or genomic library to 
isolate a camphor thioesterase clone. 

B. For immunological screening, antibodies to the 
Bay thioesterase are prepared by injecting rabbits or mice 
with the thioesterase protein purified from Bay or with the 
truncated thioesterase protein expressed in E. coli as 
described Example 16. 

Genes encoding related proteins are isolated by 
screening the cDNA library from the plant of Interest that 
has been transferred to the expression vector lambda gtll, 
described in Chapter 12 of Maniatis,r et al. (supra) . The 
libraries are then plated, induced to produce proteins from 
the cloned genes, and lifted onto membranes to immobilize 
for screening. The thioesterase antibodies are supplied to 
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the filters containing immobilized proteins to allow 
binding of the antibody to related proteins. Clones 
encoding thioesterase proteins are identified by detection 
of the antibody /protein complex on 'the nitrocellulose | 
5 filters using a secondary antibody/enzyme conjugate system ] 
utilizing alkaline phosphate as described by Oberfelder r 
(Focus (1989) BRL/Life Technologies, Inc. 11:1-5). 
An^lygj-g; 

Clones identified using DNA hybridization or 

10 immunological screening techniques are then purified and U 

h 

the DNA isolated using techniques as provided m Maniatis, 'r 
et al. {supra) . DNA sequence of the genes is determined as L 
described in Examples 14 and 15. In this manner, it is 
verified that the clones encode a related thioesterase. 
15 Alternatively, the protein is expressed in E. coli as 

described above for the Bay thioesterase to show that it 
has the desired activity. The newly isolated plant 

thioesterase sequences can also be used to isolate genes • ^ 
for thioesterases from other plant species using the ! 
20 techniques described above. L 

By the above examples, demonstration of critical 
factors in the production of long-chain and medium-chain 
fatty acids is described- A protocol is provided to obtain 
25 partially purified C12-preferring acyl ACP thioesterase : 
from the California Bay, various properties of the protein 
are described including methods to obtain and use amino ' 
acid and nucleic acid sequence related thereto. A partial 
cDNA sequence of the Bay thioesterase is also provided with 
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a demonstration of the activity of the polypeptide encoded 
thereby. A full sequence of the Bay thioesterase is also 
given with various constructs for use in host cells . In 
addition, methods to obtain a partially purified 
preparation of a ClO-preferring acyl-ACP thioesterase from 
Cuphea hookeriana is also provided. A long-chain 
preferring acyl-ACP thioesterase from safflower is also 
described. Through this invention, one can obtain the 
amino acid and nucleic acid sequences which encode plant 
-thioesterases from a variety of sources and for a variety 
of applications . 

All publications and patent applications mentioned in 
this specification are indicative of the level of skill of 
those skilled in the art to which this invention pertains. 
All publications and patent applications are herein 
incorporated by reference to the same extent as if each 
individual publication or patent application was 
specifically and individually indicated to be incorporated 
by reference. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes 
of clarity of under stsmding, it will be obvious that 
certain changes and modifications may be practiced within 
the scope of the appended claims. 
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SEQUENCE LISTING 
5 (!) GENERAL INFORMATION: 

(i) APPLICANT: Davies, Huw Maelor 

Pollard/ Michael Roman 
Voekler, Toni Alois 
10 Thompson, Gregory A 

(ii) TITLE OF INVENTION: Plant Thioesterases 

(iii) NUMBER OF SEQUENCES: 40 

15 

(iv) COHHESPONDENCE ADDRESS: 

(A) ADDRESSEE: Calgene, Inc. 
20 (B) STREET: 1920 Fifth Street 

(C) CITY: Davis 

(D) STATE: California 

25 

(E) COUNTRY: USA 

(F) ZIP: 95616 
30 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 720 KB storage 

(B) COMPUTER: Apple Macintosh 

35 

(C) OPERATING SYSTEM: Macintosh 6.0.7 

(D) SOFTWARE: Micro softWord 4.0 
40 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: unassigned 

(B) FILING DATE: 

45 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

50 (A) APPLICATION NUMBER: 07/662,007 

(B) FILING DATE: 27-FEB-1991 

(A) APPLICATION NUMBER: 07/514,030 

55 

(B) FILING DATE: 26-APR-1990 

(A) APPLICATION NUMBER: 07/620,426 
60 (B) FILING DATE: 30-NOV-1990 




(viii) ATTOrReY/AGENT INFORMATION: 

(A) NAME: Elizabeth Lassen 

(B) REGISTRATION NUMBER: 31,845 

(A) NAME: Donna E. Scherer 

(B) REGISTRATION NUMBER: P-34,719 

(C) REFERENCE/DOCKET NUMBER: CGNE 70-3 WO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 916-753-6313 

(B) TELEFAX: 916-753-'1510 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Try Pro Thr Trp Pro Asn Phe Val Leu Xaa Thr Met Leu He Gly Ala 
1 5 10 15 



Gin 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 



<A) LENGTH: 



6 amino acids 



(B) TYPE: 



amino acid 



(D) TOPOLOGY: 



linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Asp Leu Met Trp Val Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 



(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Gly Tyr Asn Pro Xaa Asp lie Pro Phe Val Xaa 
10 1 5 10 



(2) INFORMATION FOR SEQ ID NO: 4: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

20 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Thr Xaa Thr Leu Val Asp Val Val Pro Phe Val He Trp Phe Val Phe 
15 10 15 

30 He Asp Asn Val Ala Val Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 5: 

35 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

45 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Asn Asp Xaa Thr Ala Asp Tyr He Gin Xaa Xaa Leu Thr Pro Arg 
15 10 15 

50 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

55 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Gly Gly Trp Val Phe Glu Thr Val Pro Asp Xaa He Phe Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: j 
(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH; 17 amino acids { 

(B) TYPE: amino acid 

15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 

20 

Asn Glu Thr Gly Val He Xaa Val Val Met Xaa Val Ala Phe Gly Pro T 
1 5 . 10 15 r 

I- 

He ■ t 

25 ^ 
(2) INFORMATION FOR SEQ ID NO: 8: L 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

40 Xaa Val Gly He Leu Gly Asp Gly Phe Gly Thr Thr Leu Glu Met Ser f 
1 5 10 15 \ 



45 



Lys 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

60 Gly He Ser Val He Pro Ala Glu Pro Arg 
1 5 10 



10 



20 



30 



35 



40 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

15 Leu Asn Asp Xaa Thr Ala Asp Tyr lie Gin Gly Gly Leu Thr Pro 
1 5 . 10 15 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Ser Val Gly He Leu Gly Asp Gly Phe Gly Thr Thr Leu Xaa Met Ser 
1 -5 10 15 

Lys 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Gly He Ser -Val He Pro Ala Glu Pro Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Val Ala Glu Val Phe Glu Thr Val Pro Asp Xaa He Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Thr Asp He Leu Ala Val Met Asn Xaa Met Gin Phe Ala Thr Leu 
1 5 10 15 

Asn Xaa Ala Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Xaa He Gly Pro Ala Phe He Asp Asn Val Ala Val Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE; amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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Xaa Xaa lie Gly Pro Ala Phe lie Asp Asn Val Ala Val Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ser Thr Ser Leu Ser Val Leu Met Asn Thr 

15 10 . 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asp Ser lie Phe Glu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asp Tyr He Gin Gly Gly Leu Thr Pro Xaa Trp 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

5 (D)' TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 20: 

10 

Asp Ser Val Leu Xaa Ser Leu Thr Thr Val Xaa Gly Gly Ser Ser Glu 
15 10 15 

Ala 

15 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

20 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

30 

Asp Thr Val Xaa Val Glu Xaa lie lie Ala Asn Ser 
15 10 



35 (2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 14 amino acids 
40 • . 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Asp Xaa Phe Arg Gly lie Ser Val He Pro Ala Glu Pro Arg 
50 1 5 10 



(2) INFORMATION FOR SEQ ID NO: 23: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

60 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Asp Ser Phe Arg Gly He Ser He Val Ala Glu Pro Arg 
5 1 5 10 



(2) INFORMATION FOR SEQ ID NO: 24: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 aitiino acids 

(B) TYPE: amino acid 

15 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



25 



35 



45 



55 



60 



Asp Trp Val He Glu Tyr Arg Pro Gly Val 
15 10 



12) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



40 Asp His Leu Leu Glu Leu Glu Gly Gly Ser Glu Val Leu Xaa Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Gly Ser Leu Thr Glu Asp Gly Leu Ser Tyr Lys Glu Val Phe He He 
15 10 15 

Arg Xaa Tyr Glu Val Gly He Asn Lys Thr Ala 
20 25 



TTX-r JW-tAJl 109 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

5 

(A) LENGTH; 37 amino acids 

(B) TYPE: amino acid 

10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



15 



45 
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Asn Lys His Val Asn Asn Val Thr Tyr lie Gly Xaa Val Leu Glu Ser 
15 10 15 



He Pro Gin Glu Val He Asp Thr His Glu Leu Gin Thr He Thr Leu 
20 20 25 30 

Asp Tyr Arg Arg Glu 
35 

25 

(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

40 Ala Val Arg Thr Gly Glu Gin Pro Thr Gly Val Ala Val Gly Leu Lys 
1 5 10 15 



Glu Ala 



(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS; 
50 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



60 Lys Asp His Ala Ser Gly Gin Val He Gly 
1 • 5 10 
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(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Asn Glu Asp Thr Arg Arg Leu Gin Lys Val Asn Asp Asp Val Glu Asp 
1 5 10 15 

Glu Tyr Leu Val Phe He Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 aiaind acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

His He Glu He Tyr Xaa Tyr Pro Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTGGATCCGA YATHYTNGCIN GTNATGAA 28 



(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 



Ill 



55 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

60 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

5 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Other nucleic acid, 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GCCTCGAGCK NGGYTCNGCN GGRATNAC 28 

15 (2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 210 base pairs 

20 

(B) TYPE: nucleic acid f 

y 

(C) STRANDEDNESS: single f_ 
25 <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

30 

GAT ATT CTG GCC GTG ATG AAT CAC ATG CAG GAG GCT ACA CTT AAT CAT 48 
Asp He Leu Ala Val MET Asn His MET Gin Glu Ala Thr Leu Asn His 
15 10 15 

35 GCG AAG AGT GTG GGA ATT CTA GGA GAT GGA TTC GGG ACG ACG CTA GAG 96 
Ala Lys Ser Val Gly He Leu Gly Asp Gly Phe Gly Thr Thr Leu Glu 
20 25 30 

ATG AGT AAG AGA GAT CTG ATG TGG GTT GTG AGA CGC ACG CAT GTT GCT 144 
40 MET Ser Lys Arg Asp Leu MET Trp Val Val Arg Arg Thr His Val Ala 
35 40 45 

GTG GAA CGG TAC CCT ACT TGG GGT GAT ACT GTA GAA GTA GAG TGC TGG 192 
Val Glu Arg Tyr Pro Thr Trp Gly Asp Thr Val Glu Val Glu Cys Trp 
45 50 55 60 

GAA TGG TGC ATC TGG AAA 210 
Glu Trp Cys He Trp Lys 
65 70 

50 . 
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<ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

5 ACG GOG GAT TAG ATA CAG GGA GGT TTG ACT CCT CGA TGG AAT GAT TTG 48 
Thr Ala Asp Tyr lie Gin Gly Gly Leu Thr Pro Arg Trp Asn Asp .Leu 
15 10 15 

GAT GTC AAT CAG CAT GTG AAC AAC CTC AAA TAC GTT GCC TGG GTT TTT 96 
10 Asp Val Asn Gin His Val Asn Asn Leu Lys Tyr Val Ala Trp Val Phe 

20 25 30 [ 

GAG ACC GTC CCA GAC TCC ATC TTT GAG AGT CAT CAT ATT TCC AGC TTC 144 
Glu Thr Val Pro Asp Ser He Phe Glu Ser His His lie Ser Ser Phe 
15 35 40 45 ~ 

ACT CTT GAA TAC AGG AGA GAG TGC ACG AGG GAT AGC GTG CTG CGG TCC 192 
Thr Leu Glu Tyr Arg Arg Glu Cys Thr Arg Asp Ser Val Leu Arg Ser 
50 55 60 

20 

CTG ACC ACT GTC TCT GGT GGC TCG TCG GAG GCT GGG TTA GTG TGC GAT 240 r 
Leu Thr Thr Val Ser Gly Gly Ser Ser Glu Ala Gly Leu Val Cys Asp h 

65 70 75 80 I 

1- 
t 

25 CAC TTG CTC CAG CTT GAA GGT GGG TCT GAG GTA TTG AGG GCA AGA ACA 288 

His Leu Leu Gin Leu Glu Gly Gly Ser Glu Val Leu Arg Ala Arg Thr i_ 
85 90 95 

GAG TGG AGG CCT AAG CTT ACC GAT AGT TTC AGA GGG ATT AGT GTG ATA 336 
30 Glu Trp Arg Pro Lys Leu Thr Asp Ser Phe Arg Gly He Ser Val He 
100 105 110 

CCC GCA GAA CCG AGG GTG TAACTAATGA AAGAAGCATC TGTTGAAGTT 384 
Pro Ala Glu Pro Arg Val 
35' 115 

TCTCCCATGC TGTTCGTGAG . GATACTTTTT AGAAGCTGCA GTTTGCATTG CTTGTGCAGA 444 
ATCATGGTCT GTGGTTTTAG ATGTATATAA AAAATAGTCC TGTAGTCATG AAACTTAATA 504 

40 r 

TCAGAAAAAT AACTCAATGG GTCAAGGTTA TCGAAGTAGT CATTTAAGCT TTGAATATGT 564 \ 
TTTGTATTCC TCGGCTTAAT CTGTAAGCTC TTTCTCTTGC AATAAAGTTC GCCTTTCG 622 \ 
45 - . 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to itiRNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CTTCAAGGGG GTTGGACTCC GCGATGGAAT GATTTGGATG TCAATCAGCA CGTGAACAAT 60 



50 



60 
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ATCAAATACT TGGCTGGATT TTTAAGAGCG TCCCAGACTA TATCTATGAG AATCATCATC 120 
TTTCTAGCAT CACTCTCGAA TACAGGAGAG AGTGCACAAG GGGCAGAGCA ACTGCAGTCC 180 

5 

CTGACCACTG TTTGTGGTGG CTCGTCCGAA GCTGGGGTCA TATGTGAGCA CCTACTCCAG 240 
CTTGAGGATG GGTCTGAGGT TTTGAGGGCA AGAACAGATT GGGAGGCCCA AGCGCACCGC 300 
10 ATAGTTTCGA AGGCATTAGT GAGAGATTCC CGCAGCAAGA ACCGGCGTAA TTAATGACAG 360 
AAGCATCAGA TATAGTTTCT CCTGTGCTGT TCCTGAGAAT GCATCTTACA AGTCGTGGTT 420 
TGGATTGCTT GTGCAGAATC ATGGTTTGTG CTTTCAGAAG TACATCTAAA TTAGTCCAAG 480 

15 

TTATATGACT CCATATTGGA AAATAACTCG ATGAGTCGTG CTCTTGAAAT GGTCTTTTAA 540 
GCTTTGAAAT AAAGTACCAC TTAATCCAAA AAAAAAAAAA A 581 

20 

(2) INFORMATION FOR SBQ ID NO: 37; 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

35 Met Lys Ala Val Met Leu Ala Arg Asp Gly Arg Gly Met Lys Pro Arg 
1 5 10 15 

Ser Ser Asp Leu Gin Leu Arg Ala Gly Asn Ala Pro Thr Ser Leu Lys 
20 25 30 



40 
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Met lie Asn Gly Thr Lys Phe Ser Tyr Thr Glu Ser Leu Lys Arg Leu 
35 40 45 



Pro Asp Trp Ser Met Leu Phe Ala Val lie Thr Thr lie Phe Ser Ala 
45 50 55 60 

Ala Glu Lys Gin Trp Thr Asn Leu Glu Trp Lys Pro Lys Pro Lys Leu 
65 70 75 80 

50 Pro Gin Leu Leu Asp Asp His Phe Gly Leu His Gly Leu Val Phe Arg 

85 90 95 



Arg Thr Phe Ala lie Arg Ser Tyr Glu Val Gly Pro Asp Arg Ser Thr 

100 105 110 

Ser He Leu Ala Val Met Asn His Met Gin Glu Ala Thr Leu Asn His 
115 120 125 



Ala Lys Ser Val Gly He Leu Gly Asp Gly Phe Gly Thr Thr Leu Glu 
60 130 135 140 

Met Ser Lys Arg Asp Leu Met Trp Val Val Arg Arg Thr His Val Ala 
145 150 155 160 



Val Glu Arg Tyr Pro Thr Trp Gly Asp Thr Val Glu Vai Giu Cys Trp 
165 170 175 

5 He Gly Ala Ser Gly Asn Asn Gly Met Arg Arg Asp Phe Leu Val Arg 
180 185 190 

Asp Cys Lys Thr Gly Glu He Leu Thr Arg Cys Thr Ser Leu Ser Val 
195 200 205 

10 

Leu Met Asn Thr Arg Thr Arg Arg Leu Ser Thr He Pro Asp Glu Val 
210 215 220 

Arg Gly Glu He Gly Pro Ala Phe He Asp Asn Val Ala Val Lys Asp 
15 225 230 235 240 

Asp Glu He Lys Lys Leu Gin Lys Leu Asn Asp Ser Thr Ala Asp Tyr 
245 250 255 

20 He Gin Gly Gly Leu Thr Pro Arg Trp Asn Asp Leu Asp Val Asn Gin 
260 265 270 

His Val Asn Asn Leu Lys Tyr Val Ala Trp Val Phe Glu Thr Val Pro ^ 
275 280 285 

25 

Asp Ser He Phe Glu Ser His His He Ser Ser Phe Thr Leu Glu Tyr 
290 295 300 



r 



Arg Arg Glu Cys Thr Arg Asp Ser Val Leu Arg Ser Leu Thr Thr Val 
30 305 310 315 320 

Ser Gly Gly Ser Ser Glu Ala Gly Leu Val Cys Asp His Leu Leu Gin 
325 330 335 

35 Leu Glu Gly Gly Ser Glu Val Leu Arg Ala Arg Thr Glu Trp Arg Pro 
340 345 350 

Lys Leu Thr Asp Ser Phe Arg Gly He Ser Vai He Pro Ala Glu Pro 
355 360 365 

40 

Arg Val ! 
370 

45 (2) INFOPMATION FOR SEQ ID NO: 38; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1561 base pairs 

50 

(B) TYPE: nucleic acid 

(C) STRiJJNDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to inRNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

60 

AGAGAGAGAG AGAGAGAGAG AGCTAAATTA AAAAAAAAAC CCAGAAGTGG 6AAATCTTCC 60 
CCATGAAATA ACGGATCCTC TTGCTACTGC TACTACTACT ACTACAAACT GTAGCCATTT 120 
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ATATAATTCT ATATAATTTT CAACRTGGCC ACCACCTCTT TAGCTTCCGC TTTCTGCTCG 180 

ATGAAAGCTG TAATGTTGGC TCGTGATGGC CGGGGCATGA AACCCAGGAG CAGTGATTTG 240 

5 

CAGCTGAGGG CGGGAAATGC GCCAACCTCT TTGAAGATGA TCAATGGGAC CAAGTTCAGT 300 

TACACGGAGA GCTTGAAAAG GTTGCCTGAC TGGAGCATGC TCTTTGCAGT GATCACAACC 360 

10 ATCTTTTCGG CTGCTGAGAA GCAGTGGACC AATCTAGAGT GGAAGCCGAA GCCGAAGCTA 420 

CCCCAGTTGC TTGATGACCA TTTTGGACTG CATGGGTTAG XTTTCAGGCG CACCTTTGCC 480 

ATCAGATCTT ATGAGGTGGG ACCTGACCGC TCCACATCTA TACTGGCTGT TATGAATCAC 540 

15 

ATGCAGGAGG CTACACTTAA TCATGCGAAG AGTGTGGGAA TTCTAGGAGA TGGATTCGGG 600 

ACGACGCTAG AGATGAGTAA GAGAGATCTG ATGTGGGTTG TGAGACGCAC GCATGTTGCT 660 

20 GTGGAACGGT ACCCTACTTG GGGTGATACT GTAGAAGTAG AGTGCTGGAT TGGTGCATCT .720 

GGAAATAATG GCATGCGACG TGATTTCCTT GTCCGGGACT GCAAAACAGG CGAAATTCTT 780 

ACAAGATGTA CCAGCCTTTC GGTGCTGATG AATACAAGGA CAAGGAGGTT GTCCACAATC 840 

25 

CCTGACGAAG TTAGAGGGGA GATAGGGCCT GCATTCATTG ATAATGTGGC TGTCAAGGAC 900 

GATGAAATTA AGAAACTACA GAAGCTCAAT GACAGCACTG CAGATTACAT CCAAGGAGGT 960 

30 TTGACTCCTC GATGGAATGA TTTGGATGTC AATCAGCATG TGAACAACCT CAAATACGTT 1020 

GCCTGGGTTT TTGAGACCGT CCCAGACTCC ATCTTTGAGA GTCATCATAT TTCCAGCTTC 1080 

ACTCTTGAAT ACAGGAGAGA GTGCACGAGG GATAGCGTGC TGCGGTCCCT GACCACTGTC 1140 

35 

TCTGGTGGCT CGTCGGAGGC TGGGTTAGTG TGCGATCACT TGCTCCAGCT TGAAGGTGGG 1200 

TCTGAGGTAT TGAGGGCAAG AACAGAGTGG AGGCCTAAGC TTACCGATAG TTTCAGAGGG 1260 

40 ATTAGTGTGA TACCCGCAGA ACCGAGGGTG TAACTAATGA AAGAAGCATC TGTTGAAGTT 1320 

TCTCCCATGC TGTTCGTGAG GATACTTTTT AGAAGCTGCA GTTTGCATTG CTTGTGCAGA 1380 

ATCATGGTCT GTGGTTMAG ATGTATATAA AAAATAGTCC TGTAGTCATG AAACTTAATA 1440 

45 

TCAGAAAAAT AACTCAATGG GTCAAGGTTA TCGAAGTAGT CATTTAAGCT TTGAAATATG 1500 

TTTTGTATTC CTCGGCTTAA TCTGTAAGCT CTTTCTCTTG CAATAAAGTT CGCCTTTCAA 1560 

50 T 1561 
(2) INFORMATION FOR SBQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

55 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



lie 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Leu Glu Trp Lys Pro Lys Pro Lys Leu Pro Glu Leu Leu Asp 
1 5 10 

5 

(2) INFORMATION FOR SEQ ID NO: 40: 

i 

(i) SEQUENCE CHARACTERISTICS: j 

P 

10 (A) LENGTH: 28 amino acids 1 

(B) TYPE: amino acid t" 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

20 Leu Leu Asp Asp His Phe Gly Leu His Gly Leu Val Phe Arg Arg Thr 

15 10 15 - 



Phe Ala He Arg Ser Tyr Glu Val Gly Pro Asp Arg 
20 25 



£=5 

r 



25 



I 
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What is claimed is: 

1- A «oon*ina„t DNA construct comprising at least 
a portion o^ a plant thioesterase encoding se^ence. 

2. The construct ot Clai. l -encoding a biologically 
5 active plant thioesterase. 

3. The construct of Claim 1 wherein said sequence 
" coined to a second nucleic acid sequence which Is not 

. naturally joined to said first sequence. 

4. The construct of Claim 1 wherein said sequence 
10 IS joined to a label. 

5. The construct of Claim i wherein said sequence 
encodes a precursor plant thioesterase. 

6- The construct of Claim 1 comprising, m the 5' 
to 3. direction of transcription, a transcriptional 
15 regulatory region functional in a host cell and said 
sequence. 

-"'truct of Claim 6 Wherein said sequence 
IS a sense sequence. 

8. The construct of Claim 6 wherein said sequence 
20 IS an anti-sense sequence. 

9- The construct of claim 6 further co^rlslng, a 
translatlonal regulatory region functional in a host cell 
i™.ediately to said sequence and a tr«,scriptlonal/ 
translatlonal termination regulatory region 3- to said 
25 sequence . 

10. The construct Of Claim 1 comprising an 
expression cassette capable of producing a plant 
thioesterase In a host cell co^rislng, iu the 5' to 3' 
direction of transcription, a trahscriptlonal initiation 
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regulatory region functional in said host cell^ a 
translational initiation regulatory region functional in 
said host cell, a DNA sense sequence encoding a ; 
biologically active plant thioesterase, and a 
5 transcriptional and translational teinnination regulatory i 
region functional in said host cell, wherein said plant ^ 
thioesterase encoding sequence is under the control of said 
regulatory regions. 

11. The construct of Claim 10 wherein said host cell 

10 is a plant cell. t 

12. The construct of claim 11 wherein said | 
transcriptional initiation region is obtained from a gene i 
preferentially expressed in plant seed tissue. I 

13. The construct of Claim 1 wherein said plant 

15 thioesterase demonstrates preferential hydrolysis activity 
toward fatty acyl-acyl carrier protein (ACP) substrates. 

14. The construct of Claim 1 wherein said plant 
thioesterase demonstrates preferential hydrolysis activity 

toward one or more medium- chain fatty acyl-carrier I 
20 substrates . ^ 

15 . The construct of Claim 1 wherein said sequence 
is obtainable from Umbellularla californica (Bay) , Cuphea 
hookeriana (Cuphea) or Carthamus tinctlorius (safflower) . 

16. A host cell comprising a plant thioesterase 
25 encoding sequence construct of any one of Claims 1-15. 

17. The cell of claim 16 wherein said cell is a i 
plant cell. - 

18. The cell of Claim 17 wherein said plant cell is 
in vivo. 
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19. The cell of Claim 17 wherein said plant cell is 
a Brassica plant cell, 

20. A transgenic host cell comprising an expressed 
plant thioesterase . 

21. * The cell of Claim 20 wherein said host cell is a 
plant cell, 

22 • The cell of Claim 21 wherein said plant 
thioesterase is a medium-chain preferring acyl-carrier 
thioesterase. 

23. The cell of Claim 22 wherein an untransformed 
plant cell is free of medium-chain preferring acyl-carrier 
thioesterase . 

24 . A method of producing a plant thioesterase in a 
host cell or progeny thereof comprising 

growing a host cell or progeny thereof comprising a 
construct of any one of Claims 10-12, under conditions 
which will permit the production of said plant 
thioesterase. 

25. The method of Claim 24 wherein said host cell is 
a plant cell and said construct is integrated into the 
genome of said plant cell. 

26. The method of Claim 25 wherein said plant cell 
is In vivo. 

27. A host cell comprising a plant thioesterase 
produced according to Claim 24. 

28. A cell of Claim 27 wherein said host cell is a 
plant host cell and said construct is integrated into the 
genome of said plant cell. 



29. A plant thioesterase being substantially free 
from other plant proteins and having preferential hydrolase 
activity toward one or more medium-chain fatty acid acyl- 
carrier substrates. 
5 30. The thioesterase of Claim 29 wherein said 

carrier is acyl carrier protein ("ACP") . 

31. The thioesterase of Claim 29 comprising a C12- 
pref erring acyl-ACP thioesterase. 

32. The thioesterase of Claim 29 comprising a ClO- 
10 preferring acyl-ACP thioesterase. 

33. The thioesterase of Claim 29 obtainable from an 
immature plant embryo. 

34. The thioesterase of Claim 29 comprising a C12- 
preferring acyl-ACP thioesterase obtainable from 

15 Uinbellularia calif ornica. 

35. The thioesterase of Claim 34 having a molecular 
weight of approximately 34 KDa. 

36. The thioesterase of Claim 29 obtainable from 
Cuphea . 

20 37. The thioesterase of Claim 36 comprising a ClO- 

preferring acyl-ACP thioesterase obtainable from Cuphea 
hookeriana . 

38. A method of producing a medium-chain free fatty 
acid coirqprising the steps of contacting, under enzyme 
25 reactive conditions, (1) a plant thioesterase having 

preferential hydrolase activity toward one or more medium- 
chain fatty acid acyl-carrier substrates and (2) a fatty 
acid acyl-carrier substrate capable of hydrolysis by said 
plant thioesterase, wherein said contacting is outside a 



plant naturally associated with said thioesterase, and 
whereby a medium-chain fatty acid is released from said 
carrier, 

39. The method of Claim 38 wherein said plant 
thioesterase is obtainable from Umbellularia calif ornica. 

.40. The method of Claim 38 where said plant 
thioesterase is a Umbellularia californica C12-preferring 
acyl-ACP thioesterase . 

41. The method of Claim 38 wherein said plant 
thioesterase is obtainable from Cuphea hookeriana. 

42. The method of Claim 38 wherein said plant 
thioesterase is a Cuphea hookeriana ClO-pref erring acyl-ACP 
thioesterase. 

43. The method of Claim 38 wherein said contacting 
is carried out in vitro. 

44. A method of modifying the free fatty acids in a 
plant cell from a given percentage of longer chain fatty 
acids to shorter chain fatty acids comprising: 

growing a plant cell having integrated in its genome a 
DNA construct, said construct comprising in the 5' to 3' 
direction of transcription, a transcriptional regulatory 
region functional in said plant cell and a plant 
thioesterase encoding sequence, under conditions which will 
permit the transcription of said plant thioesterase. 

45. The method of Claim 44 wherein said plant 
thioesterase encoding sequence is an anti-sense sequence. 

46. The method of Claim 44 wherein said construct 
further comprises a translational regulatory region 
functional in said plant cell immediately 5 ' to said plant 




thioesterase encoding sequence and a transcriptional/ 
translational termination regulatory region 3' to said 
sequence and wherein said plant thioesterase encoding i 
sequence is a sense sequence. I 
5 47. The method of Claim 44 wherein said plant cell 

is an oilseed embryo plant cell. 

48. The method of Claim 44 wherein the percentage of 
long chain free fatty acids is decreased and the percentage . 
of medium-chain free fatty acids is increased. 
10 49. A plant cell having a modified free fatty acid k 

composition produced according to the method of any one of h 
Claims 44-48. L 

50. A plant seed having a modified level of fatty I 
acids as compared to a seed of said plant having a native 

15 level of fatty acids produced according to a method 
comprising 

growing a plant, having integrated into the 
genome of embryo cells a recombinant DNA sequence encoding ^ 
a plant thioesterase under the control of regulatory » 
20 elements functional in seed during lipid accumulation, to L 
produce seed under conditions which will promote the 
activity of said regulatory elements, and harvesting said 
seed, 

51. The seed of Claim 50 wherein said plant is 

25 Brassica napus. ] 

52. The seed of Claim 50 wherein said seed is an 
oilseed. 

53. A method of modifying the fatty acid composition 
of triglycerides produced from an oilseed crop plant from a 



given percentage of longer chain fatty acids to shorter 
chain fatty acids comprising: 

growing a plant cell having integrated in its 
genome a DNA construct, said construct comprising in the 5* 
to 3' direction of transcription , a transcriptional 
regulatory region functional in said plant cell and a plant 
thioesterase encoding sequence, under conditions which will 
permit the transcription of said plant thioesterase. 

54. The method of Claim 53 wherein said plant 
thioesterase encoding sequence is an anti-sense sequence. 

55. The method of Claim 53 wherein said construct 
further comprisies a translational regulatory region 
functional in said plant cell immediately 5' to said plant 
thioesterase encoding sequence and a transcriptional/ 
translational termination regulatory region 3' to said 
sequence and wherein said plant thioesterase encoding 
sequence is a sense sequence. 

56. The method of Claim 53 wherein said plant cell 
is an oilseed embryo plant cell. 

57. The method of Claim 53 wherein the percentage of 
long chain fatty acids is decreased and the percentage of 
medium-chain fatty acids is increased. 

58. A plant cell having a modified fatty acid 
composition of triglycerideds produced according to the 
method of any one of Claims 53-57, 

59. The method of any one of Claims 53 to 57 wherein 
said crop plant is selected from the group consisting of 
rapeseed, sunflower, safflower, cotton, cuphea, soybean, 
peanut, coconut, oil palm and corn. 



60. A plant seed oil of a plant having an endogenous 
proportion of longer chain fatty acids to shorter chain 
fatty acids comprising a plant seed oil having a modified 
proportion of longer chain fatty acids to shorter chain 
5 fatty acids. 

; 61. The oil of Claim 60 comprising a Brassica napus - 

[ oil. 1 

62. A plant seed oil separated from a seed produced 
according to any one of Claims 50-52. 
i 10 63. A triglyceride oil from a plant selected from 

1: the group consisting of rapeseed, sunflower, cotton, t 

] cuphea, soybean, peanut, coconut^ oil palm and corn wherein 1^ 

3 the fatty acid chain length composition of the oil has been ; 

modified from a given percentage of longer chain fatty 
15 acids to shorter chain fatty acids by a method comprising: 
growing a plant cell have integrated in its 
genome a DNA construct, said construct comprising in the 5* 
to 3' direction of transcription, a transcriptional 

4- 

i 

regulatory region functional in said plant cell and a plant 
20 thioesterase encoding sequence, under conditions which will : 
permit the transcription of said plant thioesterase. 

64. The oil of Claim 63 wherein said plant 
thioesterase encoding sequence is an anti-sense sequence. 

65, The oil of Claim 64 wherein said construct 
25 further comprises a translational regulatory region : 

functional in said plant cell immediately 5' to said plant 
thioesterase encoding sequence and a : 
transcriptional/translational termination regulatory region 



3* to said sequence and wherein said plant thioesterase 
encoding sequence is a sense sequence. 

66. The oil of Claim 65 wherein the percentage of 
long chain fatty acids is decreased and the percentage of 
medium-chain free fatty acids is increased. 
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